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(57) Abstract: A method for constructing stable bioactive fusion proteins of the difficult to express tumor necrosis factor superfamily 
(TNFSF), and particularly members CD40L (CD154) and RANKL/TRANCE, with collectins, particularly pulmonary surfactant 
protein D (SPD) is described. Single trimers of these proteins lack the full stimulatory efficacy of the natural membrane forms of 
these proteins in many cases. The multimeric nature of these soluble fusion proteins enables them to engage multiple receptors on 
the responding cells, thereby, mimicking the effects of the membrane forms of these ligands. For CD40L-SPD, the resulting protein 
stimulates B cells, macrophages, and dendritic cells, indicating its potential usefulness as a vaccine adjuvant. The large size of these 
fusion proteins makes them less likely to diffuse into the circulation, thereby limiting their potential systemic toxicity. This property 
may be especially useful when these proteins are injected locally as a vaccine adjuvant or tumor immunotherapy agent to prevent 
them from diffusing away. In addition, these and other TNFSF-collecting fusion proteins present new possibilities for the expression 
of highly active, multimeric, soluble TNFSF members. 
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BACKGROUND OF THE INVENTION 



Field of the Invention 

The present invention relates to a method form preparing soluble 
multimeric proteins consisting of more than three iterations of the same bioactive 
10 molecule using recombinant DNA technology. 



multimeric fusion proteins involving the TNF superfamily (TNFSF) members as 
a fusion proteins with SPD, and more specifically, CD40L-SPD fusion proteins 
1 5 and useful modifications thereof. 

Description of Related Art 

Numerous proteins can be made using modern molecular biology 
techniques and used in diagnostic and therapeutic applications. Using 

20 recombinant DNA techniques, the DNA encoding a single amino acid chain is 
constructed and then introduced into a cell which manufactures the final protein. 
Some cells, especially bacteria like E. coli, lack the ability to properly fold the 
amino acid chains into the proper quaternary structure and they often fail to apply 
the necessary modifications (e.g., glycosylation and disulfide bond formation) that 

25 are needed for the protein to be bioactive and resistant to degradation in vivo. 

While most of these challenges can be met by expressing the amino acid chain in 
eukaryotic cells like yeast or mammalian cells in vitro, it is not always 
straightforward to express proteins that consist of two or more amino acid chains. 
In general, for multichain proteins, the single amino acid chains must associate 

30 together in some way either within the producer cell or subsequently after the 



The present invention particularly concerns a new method of producing 
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monomers are secreted from the producer cell. For artificially constructed 
molecules, the introduction into a single amino acid chain of an amino acid 
sequence which causes this chain-to-chain association can be an important step in 



One of the most widely used methods of causing two amino acid chains to 
associate is to conjoin, at the DNA coding level, segments from the protein of 
interest and a segment from a spontaneously dimerizing protein. The best 
example is to conjoin or fuse a protein with the Fc portion of immunoglobulin, 
creating a dimeric Fc fusion protein (Fanslow et al 9 J. Immunol 136:4099, 1986). 
A protein of this type can be formed from the extracellular domain of a tumor 
necrosis factor receptor fused to Fc (termed etanercept and marketed as 
ENBREL®), which is effective in the treatment of rheumatoid arthritis. A second 
example is the construction of a fusion protein between the dimerizing 
extracellular portion of CD8 with the extracellular portion of CD40L 
(Hollenbaugh et ah, EMBO J. 1 1:4313, 1 992). Here, the dimerizing CD8 portion 
of the fusion protein helps to maintain the CD40L portion in the trimeric form 
needed for its bioactivity. A more recent example is the addition of an isoleucine 
zipper motif to CD40L, which permits the -production of trimeric soluble CD40L 
molecules (Morris etal 0 J. Biol Chem. 274:418, 1999). 

The TNF superfamily (TNFSF) consists of an expanding number of 
proteins (see Table I) which are crucial for the development and functioning of 
the immune, hematological, and skeletal systems. TNFSF proteins are ligands for 
a corresponding set of receptors of the TNF receptor superfamily (TNFRSF). All 
TNFSF members are expressed as Type II membrane proteins, with the exception 
of lymphotoxin-alpha which is produced as a secreted protein. However, soluble 
forms of several TNFSF proteins can be released from the cell surface by 
proteolytic cleavage, usually by specific metal loproteinases. 



producing multichain proteins. 
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The production of soluble forms of TNFSF proteins has been an important 
step in the study of these proteins. Soluble TNFSF ligands can be used to study 
the activities of these proteins in vitro without the complexities in interpretation 
5 that result when cells or cellular membranes expressing TNFSF proteins are 
added. In addition, soluble forms of several TNFSF proteins have potential as 
therapeutic agents for human diseases. In particular, TNF-D has been extensively 
studied for the treatment of cancer and soluble CD40L is currently undergoing 
clinical trials to assess its antitumor effects. 

10 

To produce soluble forms of TNFSF proteins, either the membrane protein 
is expressed in a cell line possessing a protease capable of separating the TNFSF 
extracellular domain from the transmembrane domain or a truncated form of the 
TNFSF protein is produced which consists solely of the extracellular domain plus 

15 a signal sequence. In either case, certain soluble forms of TNFSF proteins are 
unstable in solution as simple homotrimers composed solely of the extracellular 
domain. For example, naturally solubilized TNF-D is labile under physiological 
conditions [Schuchmann, 1995 #129]. To solve this stability problem, chimeric 
proteins have been constructed according to one of four different design 

20 principles: ( 1 ) The extracellular portion of the TNFSF protein has been expressed 
fused to the dimeric portion of the immunoglobulin Fc fragment US Patent 
5,155,027, Oct. 13, 1992Jssued to, Andrzej Z. Sledziewski, et al. In the case of 
CD40L and OX40L, this yields a soluble molecule which is significantly less 
active than the native membrane form of this protein. (2) The extracellular 

25 portion of the TNFSF protein has been expressed with an antigenic tag (usually 
the FLAG motif) fused to its N-terminus [Mariani, 1996 ]. The addition of an 
antibody to the tag (e.g., anti-FLAG antibody) aggregates these proteins into a 
multimeric form. Crosslinking enhances activity on B cells. (3) The extracellular 
portion of the TNFSF protein has been expressed fused to the spontaneously 
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dimerizing extracellular portion of the CD8 molecule [Hollenbaugh, 1 992 ] . In the 
case of CD40L, this creates a hexameric molecule [Pullen, 1999 ] which is likely 
formed by two CD40L trimers attached to three CD8 dimeric stalks. Despite this, 
the addition of an anti-CD8 antibody to crosslink the CD40L-CD8 fusion protein 
5 yields a further enhancement of CD40L activity on B cells. (4) The extracellular 
portion of the TNFSF protein has been expressed fused to a trimerizing isoleucine 
zipper which maintains the overall trimeric structure of the protein [US 
Patent5,7 16,805, Feb. 10, 1998, issued to Subashini Srinivasan et al. This soluble 
CD40L trimer or 'sCD40LT' is the form of that protein now being clinically 
10 tested in humans for its anti-tumor effects. 

Compounding the difficulties in producing stable forms of soluble TNFSF 
proteins are compromises in bioactivity. As exemplified by FasL, TNF, and 
CD40L, many of the soluble forms of these proteins lack the full range of 

is stimulatory activities displayed by the membrane forms of these molecules. For 
FasL, several groups have reported that naturally produced soluble FasL 
(generated by proteolytic cleavage from the membrane form) has a spectrum of 
activities that is distinctly different from the membrane form. Soluble FasL 
induces apoptosis in activated CD4+ T cells but not fresh, resting CD4+ T cells. 

20 In contrast, both types of CD4+ T cells are killed by membrane FasL or a 
recombinant soluble form of FasL (WX1) that spontaneously aggregates into 
oligomers larger than a decamer. For TNF, T cell activation through stimulation 
of TNFR II, the 80 kDa receptor for TNF, is much greater with membrane TNF 
than soluble TNF. However, if soluble TNF is produced as a tagged protein and 

25 crosslinked with an antibody against the tag, then it completely mimics the 
activities of membrane TNF [Schneider, 1998]. Finally, for CD40L, the 
stimulatory effects of a soluble form of this TNFSF protein are enhanced by 
crosslinking [Kehry, 1 994] and yields an activity similar to membrane CD40L. 
For example, soluble CD40L-CD8 fusion protein requires crosslinking with a 

4 
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antibody to CD8 in order to drive resting B cells to proliferate to a degree similar 
to membrane-bound CD40L. ). Even more strikingly, although membrane-bound 
CD40L expressed on baculovirus-transduced SF9 insect cells is a strong B cell 
stimulus, small vesicles (10 - 1,500 nm) prepared from the membranes of these 
cells are less stimulatory. However, ultracentrifugation of these vesicles creates 
aggregates which have the full activity of the original membrane CD40L protein. 
This indicates that B cells are more highly stimulated by a large surface of CD40L 
than they are by a smaller surface expressing this membrane ligand. 

Taken together, the above reports suggest that, for some TNFSF/TNFRSF 
ligand/receptor pairs at least, it is essential to cluster receptors together for full 
signaling activity. By this interpretation, the efficacy of the membrane forms of 
FasL, TNF, and CD40L occurs because these ligands can move in the plane of the 
membrane toward the contact zone with a receptor-bearing responding cell, 
thereby clustering ligated receptors to form a receptor-dense region of the 
membrane. This interpretation is further supported by experiments where 
crosslinking of a soluble TNFSF protein effectively mimics the activity of the 
membrane form of the protein [Scheider, 1998]. 

In all of the above examples, no more than three amino acid chains have 
been caused to associate together. There is a need to produce multimeric protein 
molecules where more than three amino acid chains are caused to associate into a 
single soluble molecular complex. An important example comes from studies of 
CD40L (also called CD 154 or TNFSF5), which is a member of the TNF family of 
molecules that are normally expressed as insoluble, cell membrane proteins. It 
has been shown that soluble homotrimers composed of the extracellular regions of 
CD40L, TNF, and FasL are not potently active on resting cells that bear receptors 
for these proteins. However, if these proteins are expressed with a tag on their 
ends (e.g., the FLAG peptide sequence) and then the trimers are extensively 
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crosslinked using an antibody to FLAG, full activity appears (Schneider et al 9 J. 
Exp. Med. 187: 1205, 1998). From this, it can be inferred that the soluble single- 
trimer forms of these molecules does not duplicate the multivalent interactions 
that normally occur when a receptor-bearing cell comes in contact with the 
5 membrane of a cell expressing numerous ligand trimers on its surface. This 

distinction may be due to a need for receptor clustering for full signaling (Bazzoni 
and Beutler, N. Engl J. Med. 334: 1 71 7, 1 996), which in turn is only possible with 
a multimeric ligand engaging many receptors at the same time in a localized 
region of the cell membrane. 



The present invention contemplates a method of preparing soluble, 
multimeric mammalian proteins by culturing a host cell transformed or transfected 
15 with an expression vector encoding a fusion protein comprising the hub, body, 
and neck region of a collectin molecule and a heterologous mammalian protein. 

In one embodiment, the heterologous mammalian protein comprises an 
extra cellular domain of a mammalian transmembrane protein; the resulting fusion 
20 protein forms a multimer. 

In another embodiment, the heterologous mammalian protein comprises a 
soluble protein such as a cytokine; the resulting fusion protein forms a multimer. 

25 In another embodiment, sites of proteolytic degradation are included or 

removed from the fusion protein; the resulting fusion protein forms a multimer 
from which are cleaved single units at a rate made variable by the nature of the 
proteolytic digestion sites either included or excluded. 
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In yet another embodiment, special attention is given to the 
immunogenicity of the fusion protein by altering the junction between the two 
naturally occurring proteins from which it is made; the resulting fusion protein 
may be less or more able to elicit an immune response against itself, which could 
5 lengthen its persistence or contribute to it immunological effectiveness. 



A hybrid nucleotide sequence of no more than 1 528 base pairs including a 
sequence defining a structural gene expressing a conjoined single strand of a 
multimeric TNFSF-SPD fusion protein, said structural gene having a nucleotide 

10 base sequence selected from members of the group consisting of SEQ ID NO 1, 
SEQ ID NO 3 and SEQ ID NO 5 is disclosed by this invention. In one 
embodiment, the DNA segment the structural gene has a sequence expressing a 
single hybrid amino acid chain of TNFSF-SPD, the segment having a first SPD 
nucleotide base sequence of SEQ ID NO 1, from base 32 to base 799, and a 

15 second sequence, expressing a portion of TNFSF stalk, selected from members of 
the group consisting of SEQ ID NO 1, from base 800 to base 1444, SEQ ID NO 3, 
from base 800 to base 1528, and SEQ ID NO 5, from base 800 to base 1441. 



In another embodiment, a recombinant DNA molecule has vector 
20 operatively linked to an exogenous DNA segment defining a structural gene 

expressing a single amino acid chain of TNFSF-SPD. This structural gene has a 
nucleotide base sequence selected from members of the group consisting of SEQ 
ID NO 1, SEQ ID NO 3 and SEQ ID NO 5, any functional equivalents and 
modifications thereof There is also attached an appropriate promoter for driving 
25 the expression of said structural gene in a compatible host organism. The 
organism can be E. coli, a yeast, a higher plant or animal. 



Yet another embodiment contemplated by the invention is multimeric 
TNFSF-SPD fusion protein having a plurality of polypeptide trimers, a first 

7 
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trimer consisting of peptide strands of members of the TNF superfamily (TNFSF) 
of Iigands, and a second trimer strand from a collectin molecule, each first trimer 
conjoined to a second polypeptide trimer strand from a collectin molecule, 
wherein said ligand strand is substituted for native carbohydrate recognition 
5 domains (CRD) of the collectin molecules. The conjoined collectin strands are 
covalently bound in parallel to each other, forming a multimeric fusion protein 
comprising a plurality of trimeric hybrid polypeptide strands radiating from a 
covalently bound center hub of the molecule. The free end of each trimeric 
radiating strand has a TNFSF moiety attached. The TNFSF moiety is one 
10 selected from the group consisting of Iigands LT A, TNF, LTB, and TNFSF4 to 
TNFSF 1 8 as shown in Table II, and their fiintional equivalents, and modifications 
thereof 

The invention also contemplates a method for preparing a CD40-SPD 
15 multimeric fusion polypeptide, including the steps of initiating a culture, in a 
nutrient medium, of procaryotic or eucaryotic host cells transformed with a 
recombinant DNA molecule including an expression vector, appropriate for the 
cells, operatively linked to an exogenous DNA segment defining a structural gene 
for CD40-SPD ligand. The structural gene has a nucleotide base sequence of SEQ 
20 ID NO 1 from about base 32 to about base 1444. Thereafter, the culture is 
maintained for a time period sufficient for the cells to express the multimeric 
molecule. 

Also contemplated is a method of producing a secreted, very large, 
25 biologically active, multimeric tumor necrosis factor superfamily ligand fusion 
protein chimera that is highly immunogenic and not readily diffusable. The steps 
for this method are as follows: 

1 . introducing into a host cell a first chimeric DNA construct including a 
transcriptional promoter operatively linked to a first secretory signal sequence, 

8 
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followed downstream by, and in proper reading frame with a first DNA sequence 
encoding a polypeptide chain of a first TNFSF ligand requiring multimerization 
for biological activity. This sequence is joined to a second DNA sequence 
encoding a collectin polypeptide at the site where the collectings CRD was 
5 purposefully removed. 

2. introducing into the host cell, a second DNA construct including a 
transcriptional promoter operably linked to a second secretory signal sequence 
followed downstream by, and in proper reading frame with, a third DNA sequence 
encoding a second polypeptide chain of a second TNFSF ligand , joined to a 

10 fourth DNA sequence encoding a collectin polypeptide , wherein the collecting 
CRD was purposefully removed, and then, 

3. growing the host cell in an appropriate growth medium under 
physiological conditions to allow the secretion of a large multimerized 
polypeptide fusion protein, wherein the first polypeptide chain of a TNFSF-SPD 

is protein is bound by parallel bonding of the respective collectin domain trimer to 
the second polypeptide chain of a different TNFSF-SPD polypeptide trimer, and 
wherein the multimerized polypeptide fusion protein exhibits biological activity 
characteristic of both membrane-attached TNFSFs, and 

4. isolating the biologically active, multimerized TNFSF-SPD polypeptide 
20 fusion from said host cell. The chimeric reactant compounds are humanized to 

guard against destruction by a potential human recipient's immune system. 

A final method of preparing a multimeric TNFSF-SPD ligand fusion 
protein contemplated requires a) preparing a first DNA segment coding for a 
25 strand of an exposed extracellular portion of TNFSF; b) preparing a second DNA 
segment coding for a collectin polypeptide strand, wherein the collecting CRD 
domain of the strand has been removed; c) conjoining the first and second DNAs 
in proper reading frame, thereby creating a TNFSF-collectin DNA construct; d) 
inserting the construct into an expression vector system; e) introducing the vector 

9 
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system into an appropriate cell in culture under suitable conditions; f) harvesting 
and purifying spent medium from the culture; and finally g) assaying for presence 
of multimeric TNFSF-collectin fusion protein. 



immonocompetent cells using multimeric TNFSF fusion proteins by contacting 
the cells with the multimeric TNFSF fusion proteins, causing the cells to 
proliferate, is also contemplated. The cells used may be resting B cells. There is 
also a method for increasing antigenicity of cells by contacting the cells with the 
10 multimeric TNFSF fusion proteins. In this case, the cells may be tumor cells or 
HTV positive cells. 

Other preferred embodiments contemplate the methods of preparation 
described above, wherein the host transformed is either a prokaryote, such as R 
15 coli, a eukaryote, for example yeast, such as S. cerevisiae . or a higher plant, such 
as alfalfa or tobacco. 

Still further embodiments and advantages of the invention will become 
apparent to those skilled in the art upon reading the entire disclosure contained 
20 herein. 



Figure 1. Structure of the CD40L-SPD fusion protein. The extracellular 
25 portion of the CD40L homotrimer, including its membrane-proximal stalk, was 
fused to the body of SPD. The N-terminus of SPD contains two cysteines which 
link the homopolymer together by disulfide bonds forming a hub. The trimeric 
collagenous stalk extend from the hub as a cruciate structure and end in a 
spontaneously trimerizing neck region. The amino acid domains in a single chain 
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of the CD40L-SPD are shown at the top. At the bottom is the tetrameric (four 
CD40L trimers) which is expected to form. In addition, the hub region of SPD 
can participate in stacking up to 8 or more cruciate forms into higher order 
aggregates. 

5 

Figure 2. Ion-exchange chromatography of murine CD40L-SPD. CHO cells 
expressing murine CD40L-SPD were grown in serum-free media, concentrated 
using a 100 kDa cutoff ultrafiltration membrane, and diafiltered into 50 mM 
bicine, pH 9.0, 1 mM EDTA. Using an FPLC system, the protein from 400 mL of 
10 media was applied to a Fractogel S0 3 " 650M column and eluted with a linear salt 
gradient. 3 mL samples were collected. Shown are curves for protein 
concentration (OD 2 go), conductivity as % 1 M NaCI in the buffer, and ELIS A- 
detectable CD40L-SPD assayed at 1 : 100 dilution. 



is Figure 3. Size fractionation of murine CD40L-SPD by ultrafiltration. 

CD40L-SPD is a 471 amino acid protein with a predicted molecular weight of 
49,012 for each of the twelve component chains in the dodecamer (composed of 
four trimeric subunits). This does not include added carbohydrates. Therefore, 
the full dodecamer will have a molecular weight in excess of 600,000. However, 

20 from the literature on recombinant surfactant protein D made in CHO cells, it 

appears that some of the product will be in the form of trimers that are not part of 
a cruciate-formed dodecamer. To determine what percentage of CD40L-SPD was 
produced in a multimeric form, supernatant from the transfected CHO cells were 
passed through filters of different porosities (rated for their ability to retard 

25 globular proteins). An ELIS A was used to detect the amount of CD40L-SPD 
(measured at multiple dilutions) that passed through the filter. As shown, about 
90% of the protein is retained by a 300,000 kDa cut-off filter. This indicates that 
most of the protein is in the dodecameric form. In addition, the cruciate 
dodecamers of surfactant protein D can also stack on top of each other into even 
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higher molecular weight forms. This is the likely explanation for the small 
fraction of CD40L-SPD that is retained by the 1 ,000 kDa cutoff filter 

Figure 4. Activation of human B cells by human CD40L-SPD. Conditioned 
5 media from CHO cells expressing human CD40L-SPD was added to human B 
cells along with IL-4. In the left panel, the cells were stained with CyChrome- 
labeled anti-CD19 to identify B cells and PE-labeled anti-CD3 to identify T cells. 
As shown, most of the cells proliferating in the culture were CD1 9+ CD3- B cells. 
In the right panel, the cells were stained with CyChrome-labeled anti-CD 19 to 
io identify B cells and PE-labeled anti-CD80 (B7-1 ) to identify this co-stimulatory 
molecule. As shown, almost all of the B cells were induced by CD40L-SPD to 
express CD80. 

Figure 5. Activation of murine B cells by murine CD40L-SPD. Murine 
15 CD40L-SPD was added to resting murine splenic B cells for a two day culture 
period. For the final 4 hours, the cultures were pulsed with 3 H-thymidine, 
following which the cells were harvested and DNA synthesis was measured by 
scintillation counting. As shown, CD40L-SPD is nearly as effective as anti-IgM 
in promoting the proliferation of resting B cells. 

20 

Figure 6. CD40L-SPD stimulation of macrophage chemokine production. 

Conditioned media from CHO cells expressing human CD40L-SPD, an inactive 
mutant of human CD40L-SPD (T 1 47N-CD40L-SPD), or murine CD40L-SPD 
(mCD40L-SPD) were added to cultures of human monocyte-derived 
25 macrophages. As a negative control, this media was heat-inactivated at 60 °C for 
30 minutes. Also shown is a form of soluble CD40L (sCD40L) consisting of 149 
amino acids from the extracellular domain of human CD40L (Peprotech) added at 
1 Dg/mL. 24 hours later, supernatants were collected and assay for MIP-1 □ by 
ELIS A (R & D Systems). The weak activity of soluble single-trimer CD40L 

12 
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(sCD40L) is apparent. In contrast, native human and murine CD40L-SPD 
strongly activated the macrophages to produce MIP-1 □. In contrast, heat- 
inactivated CD40L-SPD was inactive. As expected, the inactive mutant, T147N- 
CD40L-SPD, also failed to stimulate macrophages, demonstrating that the CD40L 
5 portion and not the SPD portion of the protein was responsible for stimulating the 
macrophages. 



Figure 7* Expression of RANKL/TRANCE-SPD production from CHO cells 
detected by ELISA. Antibodies against RANKL/TRANCE were used to 

10 construct an ELISA capable of detecting the RANKL/TRANCE protein. As 
shown, there was no background with the media control. Using a fusion protein 
between CD70 (CD27L or TNFSF7) and SPD, there was also no signal, indicating 
the specificity of the ELISA. However, using CHO cells transfected with an 
expression plasmid for CD70-SPD, immunoreactive secreted protein was clearly 

15 detectable. This demonstrates the generalizability of the method for expressing 
TNFSF members as fusion proteins with collectins such as SPD. 



DESCRIPTION OF THE PREFERRED EMBODIMENT 
1. Definition of Terms 
20 Multimeric: As used herein the term multimeric refers to a multimer of a 

polypeptide that is itself a trimer (i.e., a plurality of trimers). 

Functional Equivalent: Herein refers to a sequence of a peptide or 
polypeptide that has substantial structural similarity and functional similarity to 
another such sequence. 
2 5 Modifications: Herein refers to point changes involving single amino 

acids, wherein the functionality is altered, without appreciably altering the 
primary sequence or primary structure of a peptide or polypeptide. 

Amino Acid: All amino acid residues identified herein are in the natural L- 
configuration. In keeping with standard polypeptide nomenclature, J. Biol. 

13 
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Chem.. 243 :3557-59. (1969), abbreviations for amino acid residues are as shown 
in the following Table of Correspondence: 



5 

TABLE OF CORRESPONDENCE 



10 




SYMBOL 


AMINO ACID 




1 -Letter 


3-Letter 






Y 


Tyr 


L-tyrosine 




G 


Gly 


glycine 


15 


F 


Phe 


L-phenylalanine 




M 


Met 


L-methionine 




A 


Ala 


L-alanine 




S 


Ser 


L-serine 




L 


He 


L-isoleucine 


20 


L 


Leu 


L-leucine 




T 


Thr 


L-threonine 




V 


Val 


L-valine 




P 


Pro 


L-proline 




K 


Lys 


L-lysine 


25 


H 


His 


L-histidine 




Q 


Gin 


L-glutamine 




E 


Glu 


L-glutamic acid 




W 


Trp 


L-tryptophan 




R 


Arg 


L-arginine 


30 


D 


Asp 


L-aspartic acid 




N 


Asn 


L-asparagin 




C 


Cys 


L-cysteine 



It should be noted that all amino acid residue sequences are represented herein by 
35 formulae whose left to right orientation is in the conventional direction of amino- 
terminus to carboxy-terminus. Furthermore, it should be noted that a dash at the 
beginning or end of an amino acid residue sequence indicates a bond to a radical 
such as H and OH (hydrogen and hydroxyl) at the amino- and carboxy-termini, 
respectively, or a further sequence of one or more amino acid residues up to a 

14 
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total of about fifty residues in the polypeptide chain. 

Base Pair (bp): A partnership of adenine (A) with thymine (T), or of 
cytosine (C) with guanine (G) in a double stranded DNA molecule. 

Constitutive promoter: A promoter where the rate of RNA polymerase 
5 binding and initiation is approximately constant and relatively independent of 
external stimuli. Examples of constitutive promoters include the cauliflower 
mosaic virus 35S and 19S promoters described by Poszkowski et al., EMBOJ., 
3:2719 (1989) and Odell et al., Nature, 3 13:810 (1985). 

DNA: Desoxyribonucleic acid. 
10 Enzyme: A protein, polypeptide, peptide RNA molecule, or multimeric 

protein capable of accelerating or producing by catalytic action some change in a 
substrate for which it is often specific. 

Expression vector: A DNA sequence that forms control elements that 
regulate expression of structural genes when operatively linked to those genes. 
15 Expression: The combination of intracellular processes, including 

transcription and translation undergone by a structural gene to produce a 
polypeptide. 

Insert: A DNA sequence foreign to the rDNA, consisting of a structural 
gene and optionally additional DNA sequences. 

20 Nucleotide: A monomeric unit of DNA or RNA consisting of a sugar 

moiety (pentose), a phosphate, and a nitrogenous heterocyclic base. The base is 
linked to the sugar moiety via the glycosidic carbon (V carbon of the pentose) and 
that combination of base and sugar is a nucleoside. When the nucleoside contains 
a phosphate group bonded to the 3' or 5' position of the pentose it is referred to as 

25 a nucleotide. 

Operatively linked or inserted: A structural gene is covalently bonded in 
correct reading frame to another DNA (or RNA as appropriate) segment, such as 
to an expression vector so that the structural gene is under the control of the 
expression vector. 

15 
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Polypeptide and peptide: A linear series of amino acid residues 
connected one to the other by peptide bonds between the alpha-amino and 
carboxy groups of adjacent residues. 

Promoter: A recognition site on a DNA sequence or group of DNA 
sequences that provide an expression control element for a gene and to which 
RNA polymerase specifically binds and initiates RNA synthesis (transcription) of 
that gene. 

Inducible promoter: A promoter where the rate of RNA polymerase 
binding and initiation is modulated by external stimuli. Such stimuli include light, 
heat, anaerobic stress, alteration in nutrient conditions, presence or absence of a 
metabolite, presence of a ligand, microbial attack, wounding and the like. 

Spatially regulated promoter: A promoter where the rate of RNA 
polymerase binding and initiation is modulated in a specific structure of the 
organism such as the leaf, stem or root. Examples of spatially regulated 
promoters are given in Chua et al., Science, 244:174-181 (1989). 

Spatiotemporally regulated promoter: A promoter where the rate of 
RNA polymerase binding and initiation is modulated in a specific structure of the 
organism at a specific time during development. A typical spatiotemporally 
regulated promoter is the EPSP synthase-35S promoter described by Chua et al., 
Science, 244:174-181 (1989). 

Temporally regulated promoter: A promoter where the rate of RNA 
polymerase binding and initiation is modulated at a specific time during 
development. Examples of temporally regulated promoters are given in Chua et 
al., Science, 244: 1 74- 1 8 1 ( 1 989). 

Protein: A linear series of greater than about 50 amino acid residues 
connected one to the other as in a polypeptide. 

Recombinant DNA molecule: A hybrid DNA sequence comprising at 
least two nucleotide sequences not normally found together in nature. 

RNA: Ribonucleic acid. 
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Selective Genetic marker: A DNA sequence coding for a phenotypical 
trait by means of which transformed cells can be selected from untransformed 
cells. 

Structural gene: A DNA sequence that is expressed as a polypeptide, i.e., 
5 an amino acid residue sequence. 

Synthetic promoter: A promoter that was chemically synthesized rather 
than biologically derived. Usually synthetic promoters incorporate sequence 
changes that optimize the efficiency of RNA polymerase initiation. 

10 2. Introduction 

This invention discloses the production of TNFSF proteins as multimeric 
(i.e., many trimers) ligands fused onto a trimeric, branched protein backbone. 
Collectin molecules are ideal for this purpose because they are formed from many 
trimeric, collagenous arms linked to a central hub by disulfide bonds. Of the 

is collectins, pulmonary surfactant protein D (SPD) was chosen initially because it is 
a homopolymer encoded by a single gene, unlike Clq and surfactant protein A, 
which are composed of two different protein subunits. In addition, recombinant 
SPD has been successfully expressed in vitro in reasonable yield [Crouch, 1994], 
and a peptide containing the "neck" region of SPD was shown to spontaneously 

20 trimerize in solution [Hoppe, 1994]. Consequently, extracellular domains of 
human and murine CD40L were substituted for the carbohydrate recognition 
domain of pulmonary surfactant D (SPD) to create a four-armed molecule (three 
peptide chains per arm) with CD40L at the end of each arm. This molecule is 
named CD40L-SPD. In addition, because SPD tends to stack into higher order 

25 aggregates with up to 8 molecules associated at the hub [Crouch], even greater 
degree of multimerization can occur [Lu, 1993]. CD40L-SPD therefore mimics 
the expression of CD40L by an activated T cell in that it presents a multivalent 
complex similar to membrane-bound CD40L. While remaining soluble, CD40L- 
SPD equals membrane CD40L in its range of activities. 

17 
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3. Construction of expression plasmids for CD40L-SPD . 

cDNAs of exposed human and murine CD40L, removed from cell 
membranes, were cloned by PCR by well-known methods. Murine surfactant 
protein D was cloned by hemi-nested PCR from murine lung mRNA (Clonetech). 
5 cDNA was prepared using Superscript II reverse transcriptase (Life Technologies, 
Gaithersburg, MD) and random hexamers as primers. PCR primer sequences 
were as follows (the underlined bases indicate restriction endonuclease sites for 
cloning into the vector): 



10 mSPD5: 5'-CTGACATGCTGCCCTrrCTCTCCATGC-3' 

mSPD3ext: 5'-GGAGGCCAGCTGTCCTCCAGCCTGTTTGC-3' 

rmSPD5: S'GG GG^CTAGC GAATTCCACCAGGAAGCAATCTO A C ATnCTCjCrrTTTCTCT 

CCATGC-3' 

CD40L/SPD3: 5 -CTATCTTGTCC AACCTTCT ATG/GCC ATC AGGGAAC AATGC AGCTTTC-3 * 

15 SFD/CD40L5: 5-AAAGCTGCATTGTTCCCTGATGGC/CATAGAAGGTTGGACA AGAT AG AAG-3 ' 

CD40L3: S'-GGGCTCG AGGTACC AGTTCTAC ATGCCTTGOAGTGTAT A AT-V 

SPD/mCD40L5: S'-GAAAGCTGCATTGTTCCCTGATGGC/CATAGAAGATTGGATAAGGTCGAAG-S' 

mCD40L/SPD3 : 5 -CTTCG ACCTT ATC C AATCTTCT ATG/GCC ATC AGGG AAC AATGC AGCTTTC -3 ' 

mCD40L3 : 5-GGGGGGTACCCTGCTGC AGCCT AGGAC AGCGC AC-3 ' 

20 Because the murine SPD sequence of the 5' untranslated region containing 

the ribosomal binding site was unknown when this work was started [Motwani, 
1 995], a primer (rmSPDS) was designed based on the available rat sequence 
[Shimizu, 1992] which extended the 5* end with rat sequence (shown in bold) 
along with an added Nhe I site (underlined). 



25 



4. Creation of the CD40L-SPD Fusions . 

To create the CD40L-SPD fusions, overlap PCR was used. Murine SPD 
was amplified by nested PCR using mSPD5 and mSPD3ext for the first round of 
30 cycles. The product was diluted 1 : 1 ,000 and 1 DL was amplified for another 

18 
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30 cycles using rmSPD5 and CD40L/SPD3, where the 3 1 half of CD40L/SPD3 is 
a reverse primer for SPD C-terminal to the neck region (deleting the CRD) and 
the 5' half of CD40L/SPD3 contains bases from the N-terminus of the 
extracellular portion of CD40L (immediately adjacent to the transmembrane 
5 region). Similarly, the CD40L plasmid was amplified with SPD/CD40L5 and 
CD40L3, which contains a Kpn I site (underlined). All of these PCRs were 
performed with Pfu cloned polymerase (Stratagene,) using hot start (Ampliwax, 
Perkin-Elmer) and the thermocycling program: 94 °C for 2.5 min; then 30 cycles 
of 94 °C for 10 sec, 43 °C for 30 sec, and 75 °C for 7 min. 

10 

To form the chimeric construct, 1 fxL of a 1 : 1 ,000 dilution of gel-purified 
products from the above reactions was combined and amplified with rmSPD5 and 
CD40L3. Because Pfu polymerase did not consistently yield the expected 1.62 kb 
overlap product, AccuTaq LA DNA polymerase (Sigma) was used for this PCR, 

15 using the thermocycling program: 94 °C for 2.5 min; then 30 cycles of 98 °C for 
20 sec, 43 °C for 30 sec, and 68 °C for 10 min. The resulting product was 
digested with Nhe I and Kpn I, gel-purified, and ligated into the Nhe I and Kpn I 
sites in the expression plasmid, pcDNA3. 1(+) (Invitrogen, Carlsbad, C A). DH5 
E. coli were transformed with the construct and plasmid DNA was purified either 

20 by double banding in ethidium bromide-CsCl gradients or by anion exchange 
resin (QIAgen). To form the T 1 47N-CD40L-SPD construct, the same approach 
was used except that the CD40L coding region was taken from the expression 
plasmid for T147N-CD40L [Kombluth]. The amino acid sequence at the junction 
between SPD and CD40L is . . KAALFPDG/HRRLDKIE. . ., where the C-terminal 

25 portion begins the sequence for CD40L. To form mCD40L-SPD, a similar 
approach was taken except that primers SPD/mCD40L5, mCD40L/SPD3, and 
mCD40L3 were used for amplifications involving murine CD40L sequences. The 
amino acid sequence at the junction between SPD and murine CD40L is 
...KAALFPDG/HRRLDKVE..., where the C-terminal portion begins the sequence 

19 



BNSDOCID: <WO 0142298A1 I > 



WO 01/42298 PCTYUSOO/07380 

for murine CD40L. Both DNA strands of each construct were sequenced to 
confirm that the constructs were correct. In other experiments, an entirely 
humanized construct, consisting of human CD40L fused to human SPD, was 
constructed (data not shown). 

5 

5. Construction of expression plasmid for murine RANKL/TRANCE 
(TNFSF 11). 

Spleen cells from C3H/HeJ mice were stimulated with 5 Dg/ml 
concanavalin A and 10 ng/ml IL-2 (Sigma) for 8 hours (31). mRNA was isolated 
10 using the Micro FastTrack kit (Invitrogen). cDNA was prepared using 

Superscript II reverse transcriptase (Life Technologies) and random hexamers as 
primers. PCR primer sequences were as follows (where the underlined bases 
indicate restriction endonuclease sites for cloning into the vector): 

5mRANKL-ext 5'-CATGTTCCTGGCCCTCCTC-3' 

15 3mRANKL-ext: 5 , -GTACAGGCTCAAGAGAGAGGGC-3 t 

5mRANKL-int: 5- AT ACTCGAGCGC AGATGGATCCT AAC-3 * 

3mRANKL~int: 5-GGGGTTT AGCGSCCGCT AATGTTCC ACGAAATGAGTTC-3 * 

The extracellular portion of RANKL/TRANCE was cloned by nested PCR. 

20 In the first round of PCR, 5 mRANKL-ext and 3mRANKL-ext were used with Pfu 
cloned polymerase (Stragene) using the thermocycling program: 94 °C for 2.5 
min; then 30 cycles of 94 °C for 10 sec, 50 °C for 30 sec, and 75 °C for 2 min. 
The product was diluted 1 : 1 ,000 and 1 DL was amplified for another 30 cycles 
using 5mRANKL-int and 3mRANK-int, which contain an Xho I site and a Not I 

25 site respectively. The resulting product was digested with Xho I, blunt-ended 
with T4 DNA polymerase, then digested with Not I and gel-purified. The 
CD40L-SPD expression plasmid described above was digested with Msc I and 
Not I and gel purified. Then the RANKL/TRANCE sequence was ligated into 
this vector in frame with the SPD coding sequence. The amino acid sequence at 

20 
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the junction between SPD and RANKL/TRANCE is 

...KAALFPDG/RAQMDPNR..., where the N-terminal portion is from SPD and 
the C-terminal portion is the extracellular sequence of RANKL/TRANCE. Both 
DNA strands of each construct were sequenced to confirm that the constructs 
were correct. 

6. Stable transfection of DHFR-deficient CHO cells and amplification , 

DG44 (a line of CHO-K1 cells deficient in dihydrofolate reductase (DHFR)) (32) 
and pCHIP (a plasmid containing the hamster DHFR minigene) (33) were gifts from Dr. 
Lawrence Chasin, Columbia University, New York, NY . DG44 cells were cultured in 
□-MEM consisting of ribo- and deoxynucleoside-free D-MEM (BioWhittaker, 
Walkersville, MD) supplemented with 200 DM L-glutamine, 10% fetal bovine serum 
(FBS) and 10 Dg/ml each of adenosine, deoxyadenosine, and thymidine (Sigma). All 
cell cultures described were negative in a mycoplasma rRNA assay (Gen-Probe, San 
Diego). DG44 cells in six-well plates were transfected by the method of Okayama and 
Chen ((34) with 10 Dg of expression plasmid and 0.05 Dg of pCHIP (200:1 ratio). 
After two days, the transfected DG44 were trypsinized and transferred to 100 mm plates. 
At this point, the media was switched to □ VMEM which differs from D-MEM in that 
dialyzed FBS (HyClone Systems, Logan, UT) was used and no nucleoside supplements 
were added. Only cells containing the DHFR minigene were able to grow in D -MEM, 
and colonies were selected after 10 days, cloned using cloning rings, and transferred to 
12.5 cm 2 flasks. Clones were selected for expansion using an ELISA to screen for the 
production of either murine or human CD40L (see below). Using the method described 
by Kingston et al. (35), escalating doses of methotrexate were used to amplify the 
transfected genes over a period of 6-14 months until the cells grew well in 80 DM 
methotrexate. Each expressing clone was re-cloned once or twice more in order to select 
the highest expressing cells. 

7. Preparation of human and murine CD40L-SPD in serum-free media. 

Selected clones were adapted for growth in nucleoside-free UltraCHO 

media (BioWhittaker) supplemented with 50-100 ng/mL ascorbic acid and 50 ^iM 
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methotrexate (Sigma). The non-adherent population was further adapted for 
suspension growth in roller bottles. In some experiments, the cells were adapted 
from □ -MEM to CHO-S-SFM II media (Life Technologies) supplemented with 
ascorbic acid and 50 ^lg/mL L-proline. 

5 

8. ELISA assay for human and murine CD40L-SPD . 

To assay for correctly folded CD40L, wells of a MaxiSorb 96-well plate 
(Nunc) were coated overnight at 4 °C with 50 DL of carbonate-bicarbonate, pH 
9.40 buffer containing 0.5 jig/mL 24-31 anti-human CD40L MAb (Ancell) or 

10 MR1 anti-murine MAb (Bioexpress, Lebanon, NH). Wells were blocked with 3% 
bovine serum albumin (BSA) in PBS. 100 [ih samples were added to the wells 
either neat or diluted in a dilution buffer consisting of 1% BSA, 0.9 % NaCl, 50 
mM Tris pH 7.40, and 0. 1% peroxide-free Tween 20 (Sigma). After shaking for 2 
h at 600 RPM, a plate washer was used to wash the plate four times with 0.9 % 

15 NaCl, 50 mM Tris pH 7.40, and 0. 1 % peroxide-free Tween 20. Then, 100 \iL of 
diluent buffer containing 1 |ig/mL biotinylated 24-3 1 anti-human CD40L Mab 
(Ancell) or MR1 anti-murine CD40L Mab (Pharmingen, San Diego, CA) was 
added to each well and again shaken for 2 h. Following another four washes, 100 
[ih of diluent buffer containing 1 |ag/mL of streptavid in-alkaline phosphatase 

20 (Jackson) was added to each well and the plate was shaken for 1 hour. Lastly, 
after another four washes, color was developed for 10-20 min using 100 fiL/well 
of BluePhos (Kierkegaard & Perry), stop solution was added, and the wells were 
read at 650 |im in a plate reader. 

25 9. Purification of human and murine CD40L-SPD. 

Conditioned UltraCHO media was filtered using a 0.2 \i PES filter unit 
(Nalgene) and stored at 4 °C for up to 3 months. A preliminary size fractionation 
was performed by ultrafiltration through a 100 kDa-cutoff 76 mm membrane 
(YM-100, Millipore) in a 400 mL stirred cell at 10 lbs/sq. inch pressure of argon. 
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Media was concentrated to about 10 mL, diluted to 100 mL with buffer, and again 
concentrated to 10 mL for a total of 3 cycles of ultrafiltration and buffer 
exchange. Buffer was 50 mM Bicine (Calbiochem), adjusted to pH 9.0 with 
NaOH (about 32 mM Na), and 1 mM EDTA to prevent the activity of any 

5 metalloproteinase. Using FPLC equipment (Amersham-Pharmacia), the 

concentrate was filtered through a 0.45 filter, placed into a 10 mL superloop, 
applied to a 10 X 30 mm column (HR10/30, Amersham-Pharmacia) packed with 
Fractogel SO3" 650M (EM Biosciences), and eluted at 0.5 mL/min at 4 °C with a 
linear gradient of 0-500 mM NaCl in buffer. As described by the manufacturer, 

10 the resolution of proteins on Fractogel SO3" is enhanced by using a long, thin 
column geometry. Fractions were collected and screened for human or murine 
CD40L by ELISA. Positive fractions were pooled, concentrated by ultrafiltration 
(CentriPrep-30, Millipore), filtered through a 0.45 \x filter, and applied to a 
Superose 6 column (Amersham-Pharmacia) in phosphate-buffered saline. 

15 

10. Murine B cell cultures. 

C3H/HeJ mice were euthanized by C0 2 inhalation under a protocol 
approved by the Animal Subjects Committee of the San Diego VA Healthcare 
System. Splenocytes were isolated by centrifugation over Lympholyte-M 

20 (Accurate Chemical & Scientific Corp.. Westburv. NY ) and B cells were isolated 
by negative selection using anti-CD43 immunomagnetic beads (Miltenyi Biotec 
Inc.. Auburn. CA V The resting B cells were suspended in Dulbecco's MEM with 
10% FBS at a concentration of 1 X 10 6 /mL, and 100 jiL was added to the wells of 
96-well flat-bottomed plates. 100 jxL of dilutions of murine CD40L-SPD in 

25 media or media alone were added to the wells, which were incubated in 8.5% C0 2 
at 37 °C for 48 hours. Then, 0.5 fiCi/well of 3 H-thymidine was added to each 
well, and the cells were collected 4 h later onto glass fiber filters using an 
automated cell harvester. A scintillation counter was used to determine the 
incorporated radioactivity. 
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1 1. Human B cell cultures. 

Venous blood from consenting subjects was used as a source of human B 
cells under a protocol approved by the UCSD Institutional Review Board. Blood 
was collected into syringes containing 5 U/mL heparin and peripheral blood 
mononuclear cells (PBMC) were isolated by centrifiigation over Ficoll-hypaque. 
The cells were suspended at 2 X lOVmL in RPMI 1640 containing 200 yM L- 
glutamine, 10% FBS, 0.832 ^iM cyclosporin A (Sigma), and 25 ng/mL human IL- 
4 (R & D Systems) and incubated in 5% C0 2 at 37 °C as described by Schultze et 
ah (36). At intervals, the cells were stained with CyChrome-conjugated anti- 
CD^ and PE-conjugated anti-CD80XB7-l) monoclonal antibodies (Pharmingen) 
and analyzed by flow cytometry. 

12. Human monocvte-derived macrophage and dendritic cell cultures . 

As previously described [Kombluth], monocytes were isolated from 
PBMC by adherence to fibronectin-coated plates, plated into 48-well plates, and 
then cultured in RPMI 1640 containing 200 DM L-glutamine and 10% autologous 
serum for 7-10 days. Monolayers of the matured cells (about 2 X 10 5 /well), 
termed monocyte-derived macrophages or MDM, were then washed in media and 
cultured in 1 mL/well RPMI 1640 containing 200 DM L-glutamine and 10% heat- 
inactivated FBS. Alternatively, dendritic cells (DC) were formed from 
monocytes by adding GM-CSF and IL-4 to the culture media, and the resulting 
DC were used 6 days later. Preparations of CD40L-SPD were added to the wells 
as indicated. As a positive control, 100 ng/mL bacterial lipopolysaccharide (LPS) 
from E. coli 0111 :B4 (Calbiochem) was added. Supernatants were collected 24 h 
later and analyzed for cytokine content using ELISA (R&D Systems). 

Example 1. 

Design principles in constructing colIectin-TNFSF member fusion proteins. 

To express CD40L and other TNFSF members as stable, multimeric 
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proteins, the coding region of the extracellular, C-terminal portion of CD40L was 
joined in-frame to the collectin, surfactant protein D (SPD). The N-terminus of 
SPD contains two cysteines which form the disulfide bonds necessary for the 4- 
armed cruciate structure of the overall molecule [Brown-Augsburger, 1996 #506]. 
5 C-terminal to these cysteines in SPD is a long triple-helical collagenous "stalk" 
which ends in the "neck" region that promotes the trimerization of each arm of the 
structure. Immediately after this neck region, the coding sequence for the 
extracellular portion of CD40L was added, in place of the carbohydrate 
recognition domain (CRD) of SPD. The collectins were chosen as the framework 

10 for the multimeric construct because of their multi-subunit structure and the 

trimeric nature of their stalk regions. Appropriateness of replacing the CRD of a 
collectin with the extracellular region of a TNFSF member is further supported by 
structural studies of the two protein families. An analysis of the CRD crystal 
structure of another collectin, ACRP30, indicated that it was structurally 

15 superimposable upon the crystal structures of the extracellular regions of CD40L, 
TNF, and Fas [Shapiro, 1998], The successful expression of the collectin-TNFSF 
fusion protein, CD40L-SPD, indicates that other TNFSF members (Table I) could 
be conjoined to SPD in a similar manner and that other collectins besides SPD 
(Table II) could be used as a protein framework instead of SPD. Because these 

20 molecules are formed entirely from naturally occurring proteins, the production of 
an immune response (e.g., antibodies) to these fusion proteins is minimized. By 
deleting portions of the stalk region of the TNFSF proteins, additional constructs 
can be made which may be even less immunogenic. 

25 Example 2. 

Expression of human and murine CD40L-SPD in CHO cells. 

The coding regions for the extracellular portion of human CD40L, human 
T147N-CD40L ,an inactive mutant of CD40L, or murine CD40L were joined to 
the neck region of murine SPD, replacing the SPD CRD (Fig. 1). A CMV-driven 
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expression plasmid for the construct was co-transfected with a DHFR minigene 
into DNFR-deficient CHO cells. Following selection in nucleoside-free media, 
expressing CHO clones were amplified by culture in ascending doses of 
methotrexate. The resulting clones produced about 1-10 |ig/mL of the fusion 
protein over a 3 day period in media containing FBS. 

Clones were adapted for growth as suspension cells in two types of serum- 
free media. Murine CHO-SPD produced in UltraCHO (BioWhittaker) was 
largely retained (about 60% as determined by ELISA) by a 1,000 kDa cutoff 
ultrafiltration membrane (Pall Corp., Port Washington, NY), consistent with a 
large multimeric complex formed by the stacking of the SPD portion of the 
molecule. However, in CHO-S-SFM n (Life Technologies), nearly all ELISA- 
detectable murine CHO-SPD passed through a 1 00 kDa cutoff ultrafiltration 
membrane (Millipore), suggesting that the protein was either folding incorrectly 
in this media or was being degraded by proteolysis. Consequently, the 
purification method was optimized for the spent UltraCHO media. 

Example 3. 

Purification of human and murine CD40L-SPD 

Purification procedures were developed for murine CD40L-SPD, but the 
same methods could be applied to human CD40L-SPD with minor modifications. 
Murine CD40L-SPD has a predicted m. w. of 49 kDa per chain, or about 600 kDa 
per 12-chain, cruciate molecule, the amino acid sequence predicts a pi of 9.10. 
Accordingly, conditioned media was concentrated by ultrafiltration through a 100 
kDa cutoff filter, which also fractionates the sample on a size basis. After 
diafiltration into 50 mM bicine, pH 9.00 (also containing 1 mM EDTA added to 
inhibit metalloproteinases), the sample was applied to a variety of cationic 
exchange resins. Using Source 30S (Amersham-Pharmacia), most of the ELISA- 
detectable protein did not bind and was recovered in the flow-through. However, 
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as reported by Morris et al. {Morris}, Fractogel S0 3 650M retained the protein. 
The retention by this tentacular resin and not by Source 30S suggests binding to 
positively charged residues that are not on the protein surface. Using a linear 
NaCl gradient, ELISA-detectable protein elutes at between 0.15-0.30 M NaCl 
5 under these conditions (Fig. 2). In selected experiments, the protein was further 
purified using a Superose 6 sizing column. Most of the ELISA-detectable protein 
eluted in the excluded volume, indicating an apparent m.w. of greater than 1,000 
kDa (Fig. 3). 

10 Example 4. 

Activity of CD40L-SPD on human B cells. 

Schultze et al described a system using CD40L-expressing cells plus IL-4 
and cyclosporin A (to inhibit T cell growth) as a means to grow very large 
numbers of B cells from a small sample of blood. Because CD40L activates these 

15 B cells to express high levels of B7 molecules (CD80 and CD86), the proliferating 
B cells were effective in presenting peptide antigens and rival non-dividing 
dendritic cells as antigen-presenting cells (APCs) (36). To determine if the 
CD40L-SPD fusion protein could replace CD40L-expressing cells in this system, 
PBMC were cultured with CD40L-SPD in addition to IL-4 and cyclosporin A. 

20 Under these conditions the cells grew to saturation density every three days. After 
three weeks, the cultures were almost entirely CD 19+ B cells which express high 
levels of CD80 (Fig. 4). This indicates that CD40L-SPD can be used in ex vivo 
systems where a soluble yet effective form of CD40L is needed to stimulate cells 
for immunotherapeutic applications. 

25 

Example 5. 

Activity of CD40L-SPD on murine B cells. 

Resting murine B cells are particularly difficult to stimulate with most 
soluble forms of CD40L. Even with murine CD40L-CD8 fusion proteins, it is 
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necessary to crosslink the protein with antibodies against CD8 in order to achieve 
maximal proliferation in culture [Klauss, 1999]. Accordingly, resting murine B 
cells were negatively selected with immunomagnetic beads. As shown in Fig. 5, 
murine CD40L-SPD was as effective as anti-IgM antibody in driving B cells to 
5 proliferate. This indicates that CD40L-SPD can mimic the multivalent 

interactions that occur when a responding cell comes in contact with CD40L- 
bearing activating cells. 



Example 6. 

10 Activity of CD40L-SPD on human macrophages and dendritic cells. 

CD40L is a powerful stimulant for macrophages (reviewed in (28)) and 
dendritic cells (40). Accordingly, preparations of CD40L-SPD were added to 
monocyte-derived macrophages and the production of MIP-1 □ was used as a 
measure of stimulation. As shown in Fig. 6, both human and murine CD40L-SPD 
15 were able to stimulate macrophages, whereas the T147N-CD40L-SPD mutant was 
inactive as expected. 

DISCUSSION 

These examples define a new method of producing multimeric (i.e., many 
20 trimers) of CD40L as a fusion protein with SPD. Also prepared and expressed 
were similar fusion proteins between murine RANKL/TRANCE (TNFSF1 1 ) or 
murine CD27L/CD70 (TNFSF7) joined to murine SPD (data not shown). This 
suggests that virtually all TNFSF members could be successfully produced as 
fusion proteins with SPD. Furthermore, it is also likely that other collectins 
25 besides SPD could be used in these fusions, given the strong structural 

homologies between the CRDs of the collectins and the extracellular domains of 
TNFSF members [Shapiro] which can be substituted for these CRDs. Given the 
17 known TNFSF members and 9 known collectins, at least 153 fusion protein 
combinations are possible. 
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SPD was selected for initially because it is a soluble homopolymer. Other 
collectins, such as surfactant protein A, have strong binding affinities to lipids and 
specific cell receptors. Although removal of the CRD abrogates much of this 
binding, it may be partially mediated by the neck region sequence, which the 

5 fusion proteins retain. Accordingly, it would be expected that collectins other 
than SPD might confer different cell-binding and pharmacokinetic behaviors upon 
a fusion protein. For example, macrophages are known to take up and degrade 
whole SPD [Dong, 1998]. If a fusion protein other than SPD were used, the 
disposition of the fusion protein in vivo might be altered. Additionally, 

10 metalloproteinases are known to degrade the collectin, Clq , so that a fusion with 
Clq may alter the degradation of the fusion protein. For example, because 
CD40L activates macrophages and other cells to produce metalloproteinases, 
which could potentially degrade the collagenous portion of SPD and other 
collectins. Cleavage of the collagenous stalk would then be expected to release 

15 single-trimers of CD40L, which could diffuse away from the original parent 

molecule, much like a slow-release formulation of a drug. Also, the membrane- 
proximal portion of CD40L has been retained in CD40L-SPD. This sequence also 
contains protease-susceptible amino acid sequences, which can be eliminated by 
mutagenesis to retard the cleavage of CD40L from the fusion protein. Mutations 

20 in such proteinase cleavage site(s) would delay such cleavage and favor the local 
persistence of the CD40L stimulus. 

CD40L-SPD is a large macromolecule (> 1,000 kDa), and the other 
TNFSF-collectin fusion proteins would be expected to be similarly large. For 
25 native SPD, the aggregates that spontaneously form measure 1 00 nm in diameter. 
When injected into tissue, this large a complex would be expected to remain at the 
injection site for a prolonged period. Localization of the TNFSF-containing 
protein would also be expected to reduce any systemic toxicity caused by the 
release of free single-trimers into the circulation. For example, soluble CD40L in 

29 



BNSDOCID: <WO 0142298A1_I_> 




WO 01/42298 PCT/US00/07380 

blood has been linked to disease activity in lupus, and this smaller molecule may 
even cross the glomerulus to cause damage to renal tubules [Kato and Kipps, J. 
Clin. Invest. Nov. 1999]. On the other hand, because CD40L induces the 
production of chemokines which attract immune cells [Kornbluth], T cells, 

5 monocytes, and dendritic cells would be expected migrate to the site where 

CD40L-SPD was injected. This might be advantageous if CD40L-SPD were used 
as a vaccine adjuvant. In mice, soluble CD40L (sCD40LT) stimulates IgGl 
production but not cytotoxic T lymphocytes (CTLs) [Wong, 1999]. Interestingly, 
the same protein that is expressed from an injected plasmid stimulates both a 

10 strong antibody and CTL response [Gurunathan, 1998]. In the latter case, the 
plasmid would be expected to deliver a localized supply of CD40L, whereas the 
sCD40LT protein is free to diffuse away. Support for the localized use of CD40L 
in an adjuvant formulation is provided by a study using a plasmid expressing full- 
length membrane CD40L, which was very effective in stimulating both humoral 

15 and CTL immune responses [Mendoza, 1997]. Similarly, injection of adenovirus 
expressing membrane CD40L has potent antitumor activity in mice [Kikuchi, 
1999]. Similar considerations would likely apply to other fusion proteins between 
the TNFSF and collectins. 

20 Finally, for immunostimulatory proteins, it is particularly important that 

the protein not be antigenic if repeated injections are needed. For example, 
vaccination with TNF-ji modified by the addition of short peptide sequences was 
able to induce the production of disease-modifying anti-TNF-ji autoantibodies 
[Dalum, 1999]. Because CD40L-SPD and other TNFSF-collectin fusion proteins 

25 are formed from endogenous protein sequences (with the possible exception of the 
peptide sequence at the junction), the production of antibodies might not limit the 
effectiveness of repeated injections. 

In conclusion, fusions between TNFSF members and collectins offer a 
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novel means of generating large protein complexes which can provide localized 
stimulation at an injection site. Because of the multimeric nature of the collectin 
backbone, such fusion proteins may mimic the multivalent ligand surface 
presented by the membrane forms of TNFSF members to TNFRS F-bearing 
5 responding cells. Moreover, by limiting systemic toxicity while maintaining 
localized efficacy, such fusion proteins may have a role as vaccine adjuvants 
against infectious agents and tumors. 



31 



3NSOOC1D: <WO 0142298A1 I > 



WO 01/42298 PCTYUS00/07380 





Table I 






Ligands of the TNF Superfamily* 




New Lieand Symbol 


Other Names 




Genbank ID 






LTA 


Lymphotoxin-, TNF.-D, TNFSF1 


X01393 


TNF 


TNF-d, TNFSF2 


X02910 


LTB 


Lymphotoxin-, TNFSF3 


L11016 


TNFSF4 


OX-40L 


D90224 


TNFSF5 


CD40L, CD154. Gp39, T-BAM 


X67878 


TNFSF6 


FasL 


U11821 


TNFSF7 


CD27L, CD70 


L08096 


TNFSF8 


CD30L 


L09753 


TNFSF9 


4-1BBL 


U03398 


TNFSF10 


TRAIL, Apo-2L 


U37518 


TNFSF1 1 


RANKL, TRANCE, OPGL, ODF 


AF013171 


TNFSF12 


TWEAK, Apo-3L 


AF030099 


TNFSF13 


APRIL 


NM_003808 


TNFSF13B 


BAFF, THANK, BLYS 


AF 136293 


TNFSF14 


LIGHT, HVEM-L 


AF036581 


TNFSF15 


VEGI 


AF039390 


TNFSF16 


unidentified 




TNFSF17 


imidentified 




TNFSF18 


AITRL, GITRL 


AF 125303 



♦(asofNov. 1, 1999) 

Known members of ligands in the TNF superfamily, taken from the Human 

Gene Nomenclature Committee at 

http ://www. gene, ucl . ac . uk/users/hester/tnftop . htm 
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Table II 



The Collectin Superfamily 



Clq 

Mannose-binding 
protein, MBL1 
Mannose-binding 
protein, MBL2 
Pulmonary surfactant 
protein A 



Pulmonary surfactant 

protein D 

conglutinin 

collectin-43 

CL-L1 

ACRP30 

Hib27 



All collectins are formed as multimers of trimeric subunits, each containing a 
collagenous domain. The C-terminus of each collectin contains a CRD which 
binds carbohydrates and other ligands. Because of the tight similarities between 
5 the known CRD structures and the extracellular domains of TNFSF members, it is 
likely that the CRD of any collectin could be replaced with the extracellular 
domain of any TNFSF member in a structurally compatible manner. 



10 preferred embodiments, and exemplified with respect thereto, one skilled in the 
art will readily appreciate that various modifications, changes, omissions and 
substitutions may be made without departing from the spirit thereof. It is 
intended, therefore, that the present invention be limited solely by the scope of the 
following claims. 



While the present invention has now been described in terms of certain 
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CLAIMS 



10 



15 



20 



What is claimed is: 

1 . A hybrid nucleotide sequence of no more than 1 528 base pairs including a 
sequence defining a structural gene expressing a conjoined single strand of a 
multimeric TNFSF-SPD fusion protein, said structural gene having a nucleotide 
base sequence selected from members of the group consisting of SEQ ID NO 1 , 
SEQ ID NO 3 and SEQ ID NO 5. 

2. The DNA segment according to claim 1 , wherein the structural gene 
comprises: 

a segment expressing a single hybrid amino acid chain of TNFSF-SPD, said 
segment having 

a first SPD nucleotide base sequence of SEQ ID NO 1, from base 32 
to base 799, and 

a second sequence, expressing a portion of TNFSF stalk, selected 
from members of the group consisting of SEQ ID NO 1, from base 800 to 
base 1444, SEQ ID NO 3, from base 800 to base 1528, and SEQ ID NO 5, 
from base 800 to base 1441. 

3. A recombinant DNA molecule comprising a vector operatively linked to 
an exogenous DNA segment defining a structural gene expressing a single amino 
acid chain of TNFSF-SPD, said structural gene having 

a nucleotide base sequence selected from members of the group consisting of 
SEQ ID NO 1, SEQ ID NO 3 and SEQ ID NO 5, any functional equivalents and 
modifications thereof, and 

an appropriate promoter for driving the expression of said structural gene in a 
compatible host organism. 



34 



3NSDOCID: <WO. 



0142298A1J_> 



WO 01/42298 PCT/US00/07380 

4. The recombinant DNA molecule as described in claim 3 wherein said host 
organism is E. coli. 

5 5. The recombinant DNA molecule as described in claim 3, wherein said 

host organism is a yeast. 

6. The recombinant DNA molecule as described in claim 3, wherein said 
host organism is a higher plant or animal. 

10 

7. A multimeric TNFSF-SPD fusion protein comprising: 
a plurality of polypeptide trimers, 

i) a first trimer consisting of peptide strands of members of the TNF 
superfamily (TNFSF) of ligands, and 
is ii) a second trimer strand from a collectin molecule 

each first trimer conjoined to a second polypeptide trimer 
strand from a collectin molecule, wherein said ligand strand is 
substituted for native carbohydrate recognition domains (CRD) of the 
collectin molecules, 

20 said conjoined collectin strands covalently bound in parallel 

to each other, therein forming a multimeric fusion protein 
comprising a plurality of trimeric hybrid polypeptide strands 
radiating from a covalently bound center hub of the molecule, the 
free end of each trimeric radiating strand having a TNFSF moiety 

25 attached. 

8. The multimeric fusion protein according to claim 6, wherein the TNFSF 
moiety is one selected from the group consisting of ligands LTA, TNF, LTB, and 
TNFSF4 to TNFSF 1 8 as in Table II, their funtional equivalents, and 
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modifications thereof. 

9. A method for preparing a CD40-SPD multimeric fusion polypeptide, 
comprising the steps of: 

5 initiating a culture, in a nutrient medium, of procaryotic or eucaryotic host 

cells transformed with a recombinant DNA molecule including an expression 
vector, appropriate for said cells, operatively linked to an exogenous DNA 
segment defining a structural gene for CD40-SPD ligand, said structural gene 
having a nucleotide base sequence of SEQ ID NO 1 from about base 32 to about 
10 base 1444; and 

maintaining said culture for a time period sufficient for said cells to express 
said multimeric molecule. 

10. A method of producing a secreted, large, biologically active, multimeric 
15 tumor necrosis factor superfamily ligand fusion protein chimera that is highly 

immunogenic and not readily diffusable, comprising: 

introducing into a host cell a first chimeric DNA construct including 
a transcriptional promoter operatively linked to a first 
secretory signal sequence, followed downstream by, and in proper 
20 reading frame with 

a first DNA sequence encoding a polypeptide chain of 
a first TNFSF ligand requiring multimerization for biological 
activity, joined to 

a second DNA sequence encoding a collectin 
25 polypeptide at the site where the collectin 's CRD was 

purposefully removed, 
introducing into said host cell a second DNA construct including 

a transcriptional promoter operably linked to a second 
secretory signal sequence followed downstream by, and in 
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proper reading frame with, 



a third DNA sequence encoding a second polypeptide 
chain of a second TNFSF ligand , joined to 



a fourth DNA sequence encoding a collectin 
polypeptide , wherein the collectings CRD was purposefully 
removed; 



10 



15 



20 



growing said host cell in an appropriate growth medium under 
physiological conditions to allow the secretion of a large multimerized 
polypeptide fusion protein, wherein 

said first polypeptide chain of a TNFSF-SPD protein is bound by parallel 
bonding of the respective collectin domain trimer to 

said second polypeptide chain of a different TNFSF-SPD polypeptide 
trimer, and wherein said multimerized polypeptide fusion protein exhibits 
biological activity characteristic of both membrane-attached TNFSFs; and 

isolating said biologically active, multimerized TNFSF-SPD polypeptide 
fusion from said host cell. 

11. The method according to claim 10, wherein the chimeric reactant 
compounds are humanized to guard against destruction by a potential human 
recipient's immune system. 

12. A method of preparing a multimeric TNFSF-SPD ligand fusion protein, 
comprising: 

preparing a first DNA segment coding for a strand of an exposed 
extracellular portion of TNFSF; 

preparing a second DNA segment coding for a collectin polypeptide 
strand, wherein said collectings CRD domain of the strand has been 
removed; 



conjoining said first and second DNAs in proper reading frame, therein 
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creating a TNFSF-collectin DNA construct; 

inserting said construct into an expression vector system; 

introducing said vector system into an appropriate cell in culture under 
suitable conditions; 
5 harvesting and purifying spent medium from said culture; and 

assaying for presence of multimeric TNFSF-collectin fusion protein. 

13. A method for stimulating the immune response in potentially 
immonocompetent cells using multimeric TNFSF fusion proteins, comprising: 

io contacting said cells with said multimeric TNFSF fusion proteins, wherein 

said cells are induced to proliferate. 

14. The method according to claim 15, wherein the cells are resting B cells. 

15 15. A method for increasing antigenicity of cells, comprising: 

contacting said cells with said multimeric TNFSF fusion proteins, wherein said 
cells are tumor cells or HIV positive cells. 
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Figure 2 
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Figure 3 
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Figure 5 
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SEQUENCE LISTING 

<110> Kornbluth, Richard S 

<120> Multimeric forms of CD40L and other TNF family members 

<130> TNFSF-collectin fusion proteins 

<140> 60/111,471 
<141> 1998-12-08 

<160> 6 

<170> Patentln Ver. 2.1 

<210> 1 
<211> 1552 
<212> DNA 

<213> Artificial Sequence 
<220> 

<221> 5'UTR 
<222> (7) . . (31) 

<220> 

<221> misc__feature 
<222> (88)'. . (799) 

<223> Mature murine surfactant protein D including hub 
region, collagenous portion, and neck, but 
excluding carbohydrate recognition domain (CRD) 

<220> 

<221> misc_feature 
<222> (801) . . (1546) 

<223> Human CD40 ligand extracellular region, including 
stalk. 

<220> 

<221> sig_peptide 
<222> (32).. (88) 

<223> Signal peptide from murine surfactant protein D 
<220> 
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<221> CDS 

<222> (32) . . (1444) 

<220> 

<223> Description of Artificial Sequence: Murine 

surfactant protein D (without the CRD) fused to 
the extracellular portion of human CD40L 

<300> 

<301> Spriggs, Melanie K. 

Armitage, Richard J. 

Strockbine, L 

Clifford, K N. 

Macduff, B M. 

Sato, T A. 

Maliszewski, C R. 

Fanslow, William C. 
<302> Recombinant human CD4 0 ligand stimulates B cell 

proliferation and immunoglobulin E secretion. 
<303> J. Exp. Med. 
<304> 176 
<305> 6 

<306> 1543-1550 

<307> 1992 

<313> 801 TO 1600 

<300> 

<301> Motwani, M 
White, R A. 

<302> Mouse surfactant protein-D. cDNA cloning, 

characterization, and gene localization to chromosome 
14. 

<303> J. Immunol. 
<304> 155 
<305> 12 

<306> 5671-5677 ' 
<307> 1995 
<313> 32 TO 800 

<400> 1 

gctagcgaat tccaccagga agcaatctga c atg ctg ccc ttt etc tec atg 
52 

Met Leu Pro Phe Leu Ser Met 

2 



BNSDOCID: <WO 0142298A1 1 > 



WO 01/42298 



PCT/US00/07380 



ctt gtc ttg ctt 
100 

Leu Val Leu Leu 
10 

age etc teg cag 
148 

Ser Leu Ser Gin 
25 

age cca aca gag 
196 

Ser Pro Thr Glu 
40 

gaa ggt cca egg 
244 

Glu Gly Pro *Arg 



ggg etc tea ggg 
292 

Gly Leu Ser Gly 
75 

gag aat ggc tct 
340 

Glu Asn Gly Ser 
90 

agt gga cct cca 
388 

Ser Gly Pro Pro 
105 

ggt ccc tct ggg 
436 

Gly Pro Ser Gly 
120 



gta cag ccc ctg 

Val Gin Pro Leu 
15 

aga tea gta ccc 

Arg Ser Val Pro 
30 

aat ggc ctg cct 

Asn Gly Leu Pro 
45 

ggt gag aag ggt 

Gly Glu Lys Gly 
60 

ttg cag ggc cct 
Leu Gin Gly Pro 

get ggc gaa cct 

Ala Gly Glu Pro 
95 

gga ctt cca ggt 

Gly Leu Pro Gly 
110 

aag cag ggg aac 

Lys Gin Gly Asn 
125 



1 

gga aat ctg gga 
Gly Asn Leu Gly 

aac ace tgc acc 

Asn Thr Cys Thr 
35 

ggt cgt . gat gga 

Gly Arg Asp Gly 
50 

gat cca ggt ttg 

Asp Pro Gly Leu 
65 

aca ggt cca gtt 

Thr Gly Pro Val 
80 

gga cca aag gga 
Gly Pro Lys Gly 

att cct ggt cca 

lie Pro Gly Pro 
115 

ata gga cct caa 

lie Gly Pro Gin 
130 



5 

gca gaa atg aag 

Ala Glu Met Lys 
20 

eta gtc atg tgt 

Leu Val Met Cys 

egg gat ggg aga 

Arg Asp Gly Arg 
55 

cca gga cct atg 

Pro Gly Pro Met 
70 

gga ccc aaa gga 

Gly Pro Lys Gly 
85 

gaa cgt gga eta 

Glu Arg Gly Leu 
100 

get ggg aaa gaa 
Ala Gly Lys Glu 

ggc aaa cca ggt 

Gly Lys Pro Gly 
135 
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cct aaa gga gag 
484 

Pro Lys Gly Glu 



caa gga tct aca 
532 

Gin Gly Ser Thr 
155 

ggt gcc cct ggt 
580 

Gly Ala Pro Gly 
170 

cct gcc gga cct 
628 

Pro Ala Gly Pro 
185 

cca gga etc aag 
676 

Pro Gly Leu Lys 
200 

ggt gaa age ggg 
724 

Gly Glu Ser Gly 



gcc tta aaa gga 
772 

Ala Leu Lys Gly 

235 

cag aaa get gca 
820 

Gin Lys Ala Ala 
250 

gaa gat gaa agg 
868 

Glu Asp Glu Arg 



get ggg ccc aaa 

Ala Gly Pro Lys 
140 

ggg gca aaa ggc 

Gly Ala Lys Gly 

gtg caa gga gcc 

Val Gin Gly Ala 
175 

gcc ggt cca cag 

Ala Gly Pro Gin 
190 

ggg gac aga ggt 

Gly Asp Arg Gly 
205 

ctt cca gac agt 

Leu Pro Asp Ser 
220 

aaa eta cag cgt 
Lys Leu Gin Arg 

ttg ttc cct gat 

Leu Phe Pro Asp 
255 

aat ctt cat gaa 
Asn Leu His Glu 



gga gaa gta ggt 

Gly Glu Val Gly 
145 

tec aca ggc ccc 

Ser Thr Gly Pro 
160 

cca ggg aat get 

Pro Gly Asn Ala 

gga get cca ggt 

Gly Ala Pro Gly 
195 

gtt cct gga gac 

Val Pro Gly Asp 
210 

get get ctg agg 

Ala Ala Leu Arg 
225 

eta gag gtt gcc 

Leu Glu Val Ala 
240 

ggc cat aga agg 
Gly His Arg Arg 

gat ttt gta ttc 
Asp Phe Val Phe 



get cct ggc atg 

Ala Pro Gly Met 
150 

aag gga gaa aga 

Lys Gly Glu Arg 
165 

gga gca gca gga 

Gly Ala Ala Gly 
180 

tec agg ggg ccc 

Ser Arg Gly Pro 

aga gga ate aaa 

Arg Gly lie Lys 
215 

cag cag atg gag 

Gin Gin Met Glu 
230 

ttc tec cac tat 

Phe Ser His Tyr 
245 

ttg gac aag ata 

Leu Asp Lys lie 
260 

atg aaa acg ata 
Met Lys Thr lie 
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265 

cag aga tgc aac 
916 

Gin Arg Cys Asn 
280 

gag att aaa age 
964 

Glu lie Lys Ser 



aaa gag gag acg 
1012 

Lys Glu Glu Thr 
315 

cag aat cct caa 
1060 

Gin Asn Pro Gin 
330 

aca aca tct gtg 
1108 

Thr Thr Ser Val 
345 

aac aac ttg gta 
1156 

Asn Asn Leu Val 
360 

caa gga etc tat 
1204 

Gin Gly Leu Tyr 



gaa get teg agt 
1252 

Glu Ala Ser Ser 
395 



270 

aca gga gaa aga 

Thr Gly Glu Arg 
285 

cag ttt gaa ggc 

Gin Phe Glu Gly 
300 

aag aaa gaa aac 
Lys Lys Glu Asn 

att gcg gca cat 

He Ala Ala His 
-335 

tta cag tgg get 

Leu Gin Trp Ala 
350 

acc ctg gaa aat 

Thr Leu Glu Asn 
365 

tat ate tat gec 

Tyr He Tyr Ala 
380 

caa get cca ttt 
Gin Ala Pro Phe 



275 

tec tta tec tta 

Ser Leu Ser Leu 
290 

ttt gtg aag gat 

Phe Val Lys Asp 
305 

age ttt gaa atg 

Ser Phe Glu Met 
320 

gtc ata agt gag 

Val He Ser Glu 

gaa aaa gga tac 

Glu Lys Gly Tyr 
355 

ggg aaa cag ctg 

Gly Lys Gin Leu 
370 

caa gtc acc ttc 

Gin Val Thr Phe 
385 

ata gee age etc 

He Ala Ser Leu 
400 



ctg aac tgt gag 

Leu Asn Cys Glu 
295 

ata atg tta aac 

He Met Leu Asn 
310 

caa aaa ggt gat 

Gin Lys Gly Asp 
325 

gee age agt aaa 

Ala Ser Ser Lys 
34 0 

tac acc atg age 
Tyr Thr Met Ser 

acc gtt aaa aga 

Thr Val Lys Arg 
375 

tgt tec aat egg 

Cys Ser Asn Arg 
390 

tgc eta aag tec 

Cys Leu Lys Ser 
405 
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ccc ggt aga ttc gag aga ate tta 
1300 

Pro Gly Arg Phe Glu Arg He Leu 

410 415 

tec gec aaa cct tgc ggg caa caa 
1348 

Ser Ala Lys Pro Cys Gly Gin Gin 
425 430 

gaa ttg caa cca ggt get teg gtg 
1396 

Glu Leu Gin Pro Gly Ala Ser Val 
440 445 

caa gtg age cat ggc act ggc ttc 
1444 

, Gin Val Ser His Gly Thr Gly Phe 

460 



etc aga get gca aat acc cac agt 

Leu Arg Ala Ala Asn Thr His Ser 

420 

tec att cac ttg gga gga gta ttt 

Ser He His Leu Gly Gly Val Phe 
435 

ttt gtc aat gtg act gat cca age 

Phe Val Asn Val Thr Asp Pro Ser 
450 455 

acg tec ttt ggc tta etc aaa etc 

Thr Ser Phe Gly Leu Leu Lys Leu 
465 470 



tgaacagtgt caccttgcag gctgtggtgg agetgacget gggagtcttc ataatacagc 
1504 

acaggcttaa geccaattat acactccaag gcatgtagaa ctggtacc 
1552 



<210> 2 
<211> 471 
<212> PRT 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: Murine 

surfactant protein D (without the CRD) fused to 
the extracellular portion of human CD40L 



<400> 2 

Met Leu Pro Phe 
1 

Asn Leu Gly Ala 
20 

Thr Cys Thr Leu 



Leu Ser Met Leu 
5 

Glu Met Lys Ser 
Val Met Cys Ser 



Val Leu Leu Val 
10 

Leu Ser Gin Arg 
25 

Pro Thr Glu Asn 



Gin Pro Leu Gly 
15 

Ser Val Pro Asn 
30 

Gly Leu Pro Gly 
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35 40 45 

Arg Asp Gly Arg Asp Gly Arg Glu Gly Pro Arg Gly Glu Lys Gly Asp 
50 55 60 

Pro Gly Leu Pro Gly Pro Met Gly Leu Ser Gly Leu Gin Gly Pro Thr 
65 70 75 ~ 80 

Gly Pro Val Gly Pro Lys Gly Glu Asn Gly Ser Ala Gly Glu Pro Gly 

85 90 • 95 

Pro Lys Gly Glu Arg Gly Leu Ser Gly Pro Pro Gly Leu Pro Gly lie 
100 105 110 

Pro Gly Pro Ala Gly Lys Glu Gly Pro Ser Gly Lys Gin Gly Asn lie 
115 120 125 

Gly Pro Gin Gly Lys Pro Gly Pro Lys Gly Glu Ala Gly Pro Lys Gly 
130 135 140 

Glu Val Gly Ala Pro Gly Met Gin Gly Ser Thr Gly Ala Lys Gly Ser 
145 150 155 160 

Thr Gly Pro Lys Gly Glu Arg Gly Ala Pro Gly Val Gin Gly Ala Pro 

165 170 175 

Gly Asn Ala Gly Ala Ala Gly Pro Ala Gly Pro Ala Gly Pro Gin Gly 
180 185 190 

Ala Pro Gly Ser Arg Gly Pro Pro Gly Leu Lys Gly Asp Arg Gly Val 
195 200 205 

Pro Gly Asp Arg Gly lie Lys Gly Glu Ser Gly Leu Pro Asp Ser Ala 
210 215 220 

Ala Leu Arg Gin Gin Met Glu Ala Leu Lys Gly Lys Leu Gin Arg Leu 
225 230 235 " 240 

Glu Val Ala Phe Ser His Tyr Gin Lys Ala Ala Leu Phe Pro Asp Gly 

245 250 255 

His Arg Arg Leu Asp Lys lie Glu Asp Glu Arg Asn Leu His Glu Asp 
260 265 270 
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Phe Val Phe Met Lys Thr lie Gin Arg Cys Asn Thr Gly Glu Arg Ser 
275 280 285 

Leu Ser Leu Leu Asn Cys Glu Glu lie Lys Ser Gin Phe Glu Gly Phe 
290 295 300 

Val Lys Asp lie Met Leu Asn Lys Glu Glu Thr Lys Lys Glu Asn Ser 
305 310 315 320 

Phe Glu Met Gin Lys Gly Asp Gin Asn Pro Gin lie Ala Ala His Val 

325 330 335 

lie Ser Glu Ala Ser Ser Lys Thr Thr Ser Val Leu Gin Trp Ala Glu 
340 345 350 

Lys Gly Tyr Tyr Thr Met Ser Asn Asn Leu Val Thr Leu Glu Asn Gly 
355 360 365 

Lys Gin Leu Thr Val Lys Arg Gin Gly Leu Tyr Tyr lie Tyr Ala Gin 
370 375 380 

Val Thr Phe Cys Ser Asn Arg Glu Ala Ser Ser Gin Ala Pro Phe lie 
385 390 395 400 

Ala Ser Leu Cys Leu Lys Ser Pro Gly Arg Phe Glu Arg lie Leu Leu 

405 410 415 

Arg Ala Ala Asn Thr His Ser Ser Ala Lys Pro Cys Gly Gin Gin Ser 
420 425 " 430 

lie His Leu Gly Gly Val Phe Glu Leu Gin Pro Gly Ala Ser Val Phe 
435 440 445 

Val Asn Val Thr Asp Pro Ser Gin Val Ser His Gly Thr Gly Phe Thr 
450 455 460 

Ser Phe Gly Leu Leu Lys Leu 
465 470 



<210> 3 



8 
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<211> 1574 
<212> DNA 

<213> Artificial Sequence 
<220> 

<221> 5 f UTR 
<222> (7) . . (31) 

<223> 5* UTR taken from rat sequence for surfactant 
protein D 

<220> 

<221> sig_peptide 
<222> (32) . . (88) 

<223> Signal peptide from murine surfactant protein D 

<220> 

<221> CDS 

<222> (32) . . (1534) 

<220> 

<221> misc^feature 
<222> (32) . . (800) 

<223> Murine surfactant protein D including hub region, 
collagenous portion, and neck, but excluding 
carbohydrate recognition domain ( CRD ) 

<220> 

<221> misc_feature 
<222> (801) . . (1534) 

<223> Murine RANKL /TRANCE extracellular region, 
including stalk 

<300> 

<301> Motwani, M 
White, R A. 
Guo, N 
Dowler, L L. 
Tauber, A I. 
Motwani, M 

<302> Mouse surfactant protein-D. cDNA cloning, 

characterization, and gene localization to chromosome 
14 . 

<303> J. Immunol. 

9 
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<304> 155 
<305> 12 
<306> 5671-5677 
<307> 1995 
<313> 32 TO 800 



<300> 

<301> Anderson, D M. 

Maraskovsky, E 
Billingsley, W L. 
Dougall, W C. 

<302> A homologue of the TNF receptor and its ligand enhance 

T-cell growth and dendritic-cell function. 
<303> Nature 
<304> 390 
<305> 6656 
<306> 175-179 
<307> 1997 
<313> 801 TO 1534 

<400> 3 

gctagcgaat tccaccagga agcaatctga c atg ctg ccc ttt etc tec atg 
52 

Met Leu Pro Phe Leu Ser Met 
1 5 



ctt gtc ttg ctt 
100 

Leu Val Leu Leu 
10 

age etc teg cag 
148 

Ser Leu Ser Gin 
25 



gta cag ccc ctg 

Val Gin Pro Leu 
15 

aga tea gta ccc 

Arg Ser Val Pro 
30 



gga aat ctg gga 
Gly Asn Leu Gly 

aac acc tgc ace 

Asn Thr Cys Thr 
35 



gca gaa atg aag 

Ala Glu Met Lys 
20 

eta gtc atg tgt 
Leu Val Met Cys 



age cca aca gag aat ggc ctg cct ggt cgt gat gga egg gat ggg aga 
196 

Ser Pro Thr Glu Asn Gly Leu Pro Gly Arg Asp Gly Arg Asp Gly Arg 
40 45 50 55 



gaa ggt cca egg ggt gag aag ggt gat cca ggt ttg cca gga cct atg 
244 

10 
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Glu Gly Pro Arg 



ggg etc tea ggg 
292 

Gly Leu Ser Gly 
75 

gag aat ggc tct 
340 

Glu Asn Gly Ser 
90 

agt gga cct cca 
388 

Ser Gly Pro Pro 
105 

ggt ccc tct ggg 
436 

Gly Pro Ser Gly 
120 

cct aaa gga gag 
484 

Pro Lys Gly Glu 



caa gga tct aca 
532 

Gin Gly Ser Thr 
155 

ggt gec cct ggt 
580 

Gly Ala Pro Gly 
170 

cct gec gga cct 
628 

Pro Ala Gly Pro 
185 



Gly Glu Lys Gly 
60 

ttg cag ggc cct 

Leu Gin Gly Pro 

get ggc gaa cct 

Ala Gly Glu Pro 
95 

gga ctt cca ggt 

Gly Leu Pro Gly 
110 

aag cag ggg aac 

Lys Gin Gly Asn 
125 

get ggg ccc aaa 

Ala Gly Pro Lys 
140 

ggg gca aaa ggc 
Gly Ala Lys Gly 

gtg caa gga gee 

Val Gin Gly Ala 
175 

gee ggt cca cag 

Ala Gly Pro Gin 
190 



Asp Pro Gly Leu 
65 

aca ggt cca gtt 

Thr Gly Pro Val 
80 

gga cca aag gga 
Gly Pro Lys Gly 

att cct ggt cca 

lie Pro Gly Pro 
115 

ata gga cct caa 

lie Gly Pro Gin 
130 

gga gaa gta ggt 

Gly Glu Val Gly 
145 

tec aca ggc ccc 

Ser Thr Gly Pro 
160 

cca ggg aat get 
Pro Gly Asn Ala 

gga get cca ggt 

Gly Ala Pro Gly 
195 



Pro Gly Pro Met 
70 

gga ccc aaa gga 

Gly Pro Lys Gly 
85 

gaa cgt gga eta 

Glu Arg Gly Leu 
100 

get ggg . aaa gaa 

Ala Gly Lys Glu 

ggc aaa cca ggt 

Gly Lys Pro Gly 
135 

get cct ggc atg 

Ala Pro Gly Met 
150 

aag gga gaa aga 

Lys Gly Glu Arg 
165 

gga gca gca gga 

Gly Ala Ala Gly 
180 

tec agg ggg ccc 
Ser Arg Gly Pro 
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cca gga etc aag 
676 

Pro Gly Leu Lys 
200 

ggt gaa age ggg 
724 

Gly Glu Ser Gly 



gee tta aaa gga 
772 

Ala Leu Lys Gly 

235 

cag aaa get gca 
820 

Gin Lys Ala Ala 
250 

aga ata tea gaa 
868 

Arg lie Ser Glu 
265 

cat gaa aac gca 
916 

His Glu Asn Ala 
280 

eta cct gac tec 
964 

Leu Pro Asp Ser 



cag aag gaa ctg 
1012 

Gin Lys Glu Leu 
315 

cca get atg atg 
1060 

Pro Ala Met Met 
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ggg gac aga ggt 

Gly Asp Arg Gly 
205 

ctt cca gac agt 

Leu Pro Asp Ser 
220 

aaa eta cag cgt 
Lys Leu Gin Arg 

ttg ttc cct gat 

Leu Phe Pro Asp 
255 

gac age act cac 

Asp Ser Thr His 
270 

ggt ttg cag gac 

Gly Leu Gin Asp 
285 

tgc agg agg atg 

Cys Arg Arg Met 
300 

caa cac att gtg 
Gin His lie Val 

gaa ggc tea tgg 
Glu Gly Ser Trp 



gtt cct gga gac 

Val Pro Gly Asp 
210 

get get ctg agg 

Ala Ala Leu Arg 
225 

eta gag gtt gee 

Leu Glu Val Ala 
240 

ggt cga gcg cag 
Gly Arg Ala Gin 

tgc ttt tat aga 

Cys Phe Tyr Arg 
275 

teg act ctg gag 

Ser Thr Leu Glu 
290 

aaa caa gee ttt 

Lys Gin Ala Phe 
305 

ggg cca cag cgc 

Gly Pro Gin Arg 
320 

ttg gat gtg gee 

Leu Asp Val Ala 
12 



aga gga ate aaa 

Arg Gly lie Lys 
215 

cag cag atg gag 

Gin Gin Met Glu 
230 

ttc tec cac tat 

Phe Ser His Tyr 
245 

atg gat cct aac 

Met Asp Pro Asn 
260 

ate ctg aga etc 
lie Leu Arg Leu 

agt gaa gac aca 

Ser Glu Asp Thr 
295 

cag ggg gee gtg 

Gin Gly Ala Val 
310 

ttc tea gga get 

Phe Ser Gly Ala 
325 

cag cga ggc aag 
Gin Arg Gly Lys 
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330 

cct gag gcc cag 
1108 

Pro Glu Ala Gin 
345 

cca teg ggt tec 
1156 

Pro Ser Gly Ser 
360 

ggc tgg gcc aag 
1204 

Gly Trp Ala Lys 



gtt aac caa gat 
1252 

Val Asn Gin Asp 
395 

cat cat gaa aca 
1300 

His His Glu Thr 
410 

gtg tat gtc gtt 
1348 

Val Tyr Val Val 
425 

atg aaa gga ggg 
1396 

Met Lys Gly Gly 
440 

ttt tat tec ata 
1444 

Phe Tyr Ser lie 



335 

cca ttt gca cac 

Pro Phe Ala His 
350 

cat aaa gtc act 

His Lys Val Thr 
365 

ate tct aac atg 

lie Ser Asn Met 
380 

ggc ttc tat tac 
Gly Phe Tyr Tyr 

teg gga age gta 

Ser Gly Ser Val 
415 

aaa acc age ate 

Lys Thr Ser lie 
430 

age acg aaa aac 

Ser Thr Lys Asn 
445 

aat gtt ggg gga 

Asn Val Gly Gly 
460 



etc acc ate aat 

Leu Thr lie Asn 
355 

ctg tec tct tgg 

Leu Ser Ser Trp 
370 

acg tta age aac 

Thr Leu Ser Asn 
385 

ctg tac gcc aac 

Leu Tyr Ala Asn 
400 

cct aca gac tat 
Pro Thr Asp Tyr 

aaa ate cca agt 

Lys lie Pro Ser 
4 35 

tgg teg ggc aat 

Trp Ser Gly Asn 
450 

ttt ttc aag etc 

Phe Phe Lys Leu 
4 65 

13 



34 0 

get gcc age ate 
Ala Ala Ser lie 

tac cac gat cga 

Tyr His Asp Arg 
375 

gga aaa eta agg 

Gly Lys Leu Arg 
390 

att tgc ttt egg 

lie Cys Phe Arg 
405 

ctt cag ctg atg 

Leu Gin Leu Met 
420 

tct cat aac ctg 
Ser His Asn Leu 

tct gaa ttc cac 

Ser Glu Phe His 
455 

cga get ggt gaa 

Arg Ala Gly Glu 
470 
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gaa att age att cag gtg tec aac cct tec ctg ctg gat ccg gat caa 
1492 

Glu lie Ser lie Gin Val Ser Asn Pro Ser Leu Leu Asp Pro Asp Gin 
475 4 80 4 85 

gat gcg acg tac ttt ggg get ttc aaa gtt cag gac ata gac 
1534 

Asp Ala Thr Tyr Phe Gly Ala Phe Lys Val Gin Asp lie Asp 
4 90 4 95 500 

tgagactcat ttcgtggaac attageggee gctaaactat 
1574 



<210> 4 
<211> 501 
<212> PRT 

<213> Artificial Sequence 
<400> 4 

Met Leu Pro Phe Leu Ser Met Leu Val Leu Leu Val Gin Pro Leu Gly 
1 5 10 15 

Asn Leu Gly Ala Glu Met Lys Ser Leu Ser Gin Arg Ser Val Pro Asn 
20 25 30 

Thr Cys Thr Leu Val Met Cys Ser Pro Thr Glu Asn Gly Leu Pro Gly 
35 40 45 

Arg Asp Gly Arg Asp Gly Arg Glu Gly Pro Arg Gly Glu Lys Gly Asp 
50 55 60 

Pro Gly Leu Pro Gly Pro Met Gly Leu Ser Gly Leu Gin Gly Pro Thr 
65 70 75 ' 80 

Gly Pro Val Gly Pro Lys Gly Glu Asn Gly Ser Ala Gly Glu Pro Gly 

85 90 " 95 

Pro Lys Gly Glu Arg Gly Leu Ser Gly Pro Pro Gly Leu Pro Gly lie 
100 105 " 110 

Pro Gly Pro Ala Gly Lys Glu Gly Pro Ser Gly Lys Gin Gly Asn He 
115 120 125 

14 
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Gly Pro Gin Gly Lys Pro Gly Pro Lys Gly Glu Ala Gly Pro Lys Gly 
130 135 140 

Glu Val Gly Ala Pro Gly Met Gin Gly Ser Thr Gly Ala Lys Gly Ser 
145 150 155 160 

Thr Gly Pro Lys Gly Glu Arg Gly Ala Pro Gly Val Gin Gly Ala Pro 

165 170 ^ 175 

Gly Asn Ala Gly Ala Ala Gly Pro Ala Gly Pro Ala Gly Pro Gin Gly 
180 185 " 190 

Ala Pro Gly Ser Arg Gly Pro Pro Gly Leu Lys Gly Asp Arg Gly Val 
195 200 205 

Pro Gly Asp Arg Gly lie Lys Gly Glu Ser Gly Leu Pro Asp Ser Ala 
210 215 220 

Ala Leu Arg Gin Gin Met Glu Ala Leu Lys Gly Lys Leu Gin Arg Leu 
225 230 235 "* 240 

Glu Val Ala Phe Ser His Tyr Gin Lys Ala Ala Leu Phe Pro Asp Gly 

245 250 255 

Arg Ala Gin Met Asp Pro Asn Arg lie Ser Glu Asp Ser Thr His Cys 
260 265 ~ 270 

Phe Tyr Arg lie Leu Arg Leu His Glu Asn Ala Gly Leu Gin Asp Ser 
275 280 285 

Thr Leu Glu Ser Glu Asp Thr Leu Pro Asp Ser Cys Arg Arg Met Lys 
1 290 295 300 

Gin Ala Phe Gin Gly Ala Val Gin Lys Glu Leu Gin His lie Val Gly 
305 310 315 320 

Pro Gin Arg Phe Ser Gly Ala Pro Ala Met Met Glu Gly Ser Trp Leu 

325 330 335 

Asp Val Ala Gin Arg Gly Lys Pro Glu Ala Gin Pro Phe Ala His Leu 
340 345 350 

15 
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Thr lie Asn Ala 
355 

Ser Ser Trp Tyr 
370 

Leu Ser Asn Gly 
385 

Tyr Ala Asn lie 



Thr Asp Tyr Leu 
420 

lie Pro Ser Ser 
435 

Ser Gly Asn Ser 
450 

Phe Lys Leu Arg 
465 

Ser Leu Leu Asp 



Val Gin Asp lie 
500 



Ala Ser lie Pro 
360 

His Asp Arg Gly 
375 

Lys Leu Arg Val 
390 

Cys Phe Arg His 
405 

Gin Leu Met Val 



His Asn Leu Met 
440 

Glu Phe His Phe 
455 

Ala Gly Glu Glu 
470 

Pro Asp Gin Asp 
485 

Asp 



Ser Gly Ser His 



Trp Ala Lys lie 
380 

Asn Gin Asp Gly 
395 

His Glu Thr Ser 
410 

Tyr Val Val Lys 
425 

Lys Gly Gly Ser 



Tyr Ser lie Asn 
460 

lie Ser lie Gin 
475 

Ala Thr Tyr Phe 
490 



Lys Val Thr Leu 
365 

Ser Asn Met Thr 



Phe Tyr Tyr Leu 
400 

Gly Ser Val Pro 
415 

Thr Ser lie Lys 
4 30 

Thr Lys Asn Trp 
445 

Val Gly Gly Phe 



Val Ser Asn Pro 
480 

Gly Ala Phe Lys 
495 



<210> 5 
<211> 1477 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Murine 

surfactant protein D (except CRD) fused to the 
extracellular domain of murine CD4 0 ligand 

<220> 

16 
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<221> 5'UTR 
<222> (7).. (31) 

<223> 5* UTR from rat surfactant protein D 
<220> 

<221> sig_peptide 
<222> (32) . . (88) 

<223> Signal peptide from murine surfactant protein D 

<220> 

<221> CDS 

<222> (32) . . (1441) 

<220> 

<221> misc_recomb 
<222> (88) . . (799) 

<223> Mature murine surfactant protein D including hub 
region, collagenous portion, and neck, but 
excluding carbohydrate recognition domain (CRD) 

<220> 

<221> misc_feature 
<222> (800) . . (1441) 

<223> Murine CD40 ligand extracellular region, including 
stalk 

<300> 

<301> Motwani, M 
White, R A. 
Guo, N 
Dowler, L L. 
Tauber, A I. 
Motwani, M 

<302> Mouse surfactant protein-D. cDNA cloning, 

characterization, and gene localization to chromosome 
14 . 

<303> J. Immunol . 

<304> 155 

<305> 12 

<306> 5671-5677 

<307> 1995 

<313> 32 TO 800 

17 
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<300> 

<301> Armitage, R 
Fanslow, W 
Sato, T A. 
Clifford, K N. 

<302> Molecular and biological characterization of a murine 

ligand for CD40 
<303> Nature 
<304> 357 
<305> 6373 
<306> 80-82 
<307> 1992 
<313> 801 TO 1441 

<400> 5 

gctagcgaat tccaccagga agcaatctga c atg ctg ccc ttt etc tec atg 
52 

Met Leu Pro Phe Leu Ser Met 
1 5 

ctt gtc ttg ctt gta cag ccc ctg gga aat ctg gga gca gaa atg aag 
100 

Leu Val Leu Leu Val Gin Pro Leu Gly Asn Leu G-y Ala Glu Met Lys 
10 15 20 



age etc teg cag aga tea gta ccc aac acc tgc ace eta gtc atg tgt 
148 

Ser Leu Ser Gin Arg Ser Val Pro Asn Thr Cys Thr Leu Val Met Cys 
25 ^ 30 35 



age cca aca gag 
196 

Ser Pro Thr Glu 
40 

gaa ggt cca egg 
244 

Glu Gly Pro Arg 



ggg etc tea ggg 
292 

Gly Leu Ser Gly 



aat ggc ctg cct 

Asn Gly Leu Pro 
45 

ggt gag aag ggt 

Gly Glu Lys Gly 
60 

ttg cag ggc cct 
Leu Gin Gly Pro 



ggt cgt gat gga 

Gly Arg Asp Gly 
50 

gat cca ggt ttg 

Asp Pro Gly Leu 
65 

aca ggt cca gtt 

Thr Gly Pro Val 
18 



egg gat ggg aga 

Arg Asp Gly Arg 
55 

cca gga cct atg 

Pro Gly Pro Met 
70 

gga ccc aaa gga 
Gly Pro Lys Gly 
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75 

gag aat ggc tct 
340 

Glu Asn Gly Ser 
90 

agt gga cct cca 
388 

Ser Gly Pro Pro 
105 

ggt ccc tct ggg 
436 

Gly Pro Ser Gly 
120 

cct aaa gga gag 
484 

Pro Lys Gly Glu 



caa gga tct aca 
532 

Gin Gly Ser Thr 
155 

ggt gcc cct ggt 
580 

Gly Ala Pro Gly 
170 

cct gcc gga cct 
628 

Pro Ala Gly Pro 
185 

cca gga etc aag 
67 6 

Pro Gly Leu Lys 
200 



get ggc gaa cct 

Ala Gly Glu Pro 
95 

gga ctt cca ggt 

Gly Leu Pro Gly 
110 

aag cag ggg aac 

Lys Gin Gly Asn 
125 

get ggg ccc aaa 

Ala Gly Pro Lys 
140 

ggg gca aaa ggc 

Gly Ala Lys Gly 

gtg caa gga gcc 

Val Gin Gly Ala 
175 

gcc ggt cca cag 

Ala Gly Pro Gin 
190 

ggg gac aga ggt 

Gly Asp Arg Gly 
205 



80 

gga cca aag gga 
Gly Pro Lys Gly 

att cct ggt cca 

lie Pro Gly Pro 
115 

ata gga cct caa 

lie 4 Gly Pro Gin 
130 

gga gaa gta ggt 

Gly Glu Val Gly 
145 

tec aca ggc ccc 

Ser Thr Gly Pro 
160 

cca ggg aat get 
Pro Gly Asn Ala 

gga get cca ggt 

Gly Ala Pro Gly 
195 

gtt cct gga gac 

Val Pro Gly Asp 
210 

19 



85 

gaa cgt gga . eta 

Glu Arg Gly Leu 
100 

get ggg aaa gaa 
Ala Gly Lys Glu 

ggc aaa cca ggt 

Gly Lys Pro Gly 
135 

get cct ggc atg 

Ala Pro Gly Met 
150 

aag gga gaa aga 

Lys Gly Glu Arg 
165 

gga gca gca gga 

Gly Ala Ala Gly 
180 

tec agg ggg ccc 
Ser Arg Gly Pro 

aga gga ate aaa 

Arg Gly lie Lys 
215 
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ggt gaa age ggg 
724 

Gly Glu Ser Gly 



gec tta aaa gga 
772 

Ala Leu Lys Gly 
235 

cag aaa get gca 
820 

Gin Lys Ala Ala 
250 

gaa gag gaa gta 
868 

Glu Glu Glu Val 
265 

aag aga tgc aac 
916 

Lys Arg Cys Asn 
280 

gag atg aga agg 
964 

Glu Met Arg Arg 



aaa gaa gag aaa 
1012 

Lys Glu Glu Lys 
315 

gat cct caa att 
1060 

Asp Pro Gin lie 
330 

gca tec gtt eta 
1108 

Ala Ser Val Leu 



ctt cca gac agt 

Leu Pro Asp Ser 
220 

aaa eta cag cgt 
Lys Leu Gin Arg 

ttg ttc cct gat 

Leu Phe Pro Asp 
255 

aac ctt cat gaa 

Asn Leu His Glu 
270 

aaa gga gaa gga 

Lys Gly Glu Gly 
285 

caa ttt gaa gac 

Gin Phe Glu Asp 
300 

aaa gaa aac age 
Lys Glu Asn Ser 

gca gca cac gtt 

Ala Ala His Val 
335 

cag tgg gee aag 
Gin Trp Ala Lys 



get get ctg agg 

Ala Ala Leu Arg 
225 

eta gag gtt gee 

Leu Glu Val Ala 
240 

ggc cat aga aga 

Gly His Arg Arg 

gat ttt gta ttc 

Asp Phe Val Phe 
275 

tct tta tec ttg 

Ser Leu Ser Leu 
290 

ctt gtc aag gat 

Leu Val Lys Asp 
305 

ttt gaa atg caa 

Phe Glu Met Gin 
320 

gta age gaa gee 
Val Ser Glu Ala 

aaa gga tat tat 

Lys Gly Tyr Tyr 
20 



cag cag atg gag 

Gin Gin Met Glu 
230 

ttc tec cac tat 

Phe Ser His Tyr 
245 

ttg gat aag gtc 

Leu Asp Lys Val 
260 

ata aaa aag eta 

lie Lys Lys Leu 

ctg aac tgt gag 

Leu Asn Cys Glu 
2 95 

ata acg tta aac 

lie Thr Leu Asn 
310 

aga ggt gat gag 

Arg Gly Asp Glu 
325 

aac agt aat gca 

Asn Ser Asn Ala 
340 

ace atg aaa age 
Thr Met Lys Ser 
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345 350 355 

aac ttg gta atg ctt gaa aat ggg aaa cag ctg acg gtt aaa aga gaa 
1156 

Asn Leu Val Met Leu Glu Asn Gly Lys Gin Leu Thr Val Lys Arg Glu 

360 365 370 375 

gga etc tat tat gtc tac act caa gtc acc ttc tgc tct aat egg gag 
1204 

Gly Leu Tyr Tyr Val Tyr Thr Gin Val Thr Phe Cys Ser Asn Arg Glu 

380 385 390 

cct teg agt caa cgc cca ttc ate gtc ggc etc tgg ctg aag ccc age 
1252 

Pro Ser Ser Gin Arg Pro Phe lie Val Gly Leu Trp Leu Lys Pro Ser 

395 400 405 

att gga tct gag aga ate tta etc aag gcg gca aat acc cac agt tec 
1300 

lie Gly Ser Glu Arg lie Leu Leu Lys Ala Ala Asn Thr His Ser Ser 

410 . 415 420 

tec cag ctt tgc gag cag cag tct gtt cac ttg ggc gga gtg ttt gaa 
1348 

Ser Gin Leu Cys Glu Gin Gin Ser Val His Leu Gly Gly Val Phe Glu 

425 430 435 

tta caa get ggt get tct gtg ttt gtc aac gtg act gaa gca age caa 
1396 

Leu Gin Ala Gly Ala Ser Val Phe Val Asn Val Thr Glu Ala Ser Gin 

440 445 450 455 

gtg ate cac aga gtt ggc ttc tea tct ttt ggc tta etc aaa etc 
1441 

Val lie His Arg Val Gly Phe Ser Ser Phe Gly Leu Leu Lys Leu 

4 60 4 65 4 7.0 

tgaacagtgc gctgtcctag getgeagcag ggtacc 
1477 



<210> 6 
<211> 470 

21 
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<212> PRT 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: Murine 

surfactant protein D (except CRD) fused to the 
extracellular domain of murine CD40 ligand 

<400> 6 

Met Leu Pro Phe Leu Ser Met Leu Val Leu Leu Val Gin Pro Leu Gly 
15 10 15 

Asn Leu Gly Ala Glu Met Lys Ser Leu Ser Gin Arg Ser Val Pro Asn 
20 25 30 

Thr Cys Thr Leu Val Met Cys Ser Pro Thr Glu Asn Gly Leu Pro Gly 
35 40 45 

Arg Asp Gly Arg Asp Gly Arg Glu Gly Pro Arg Gly Glu Lys Gly Asp 
50 55 60 

Pro Gly Leu Pro Gly Pro Met Gly Leu Ser Gly Leu Gin Gly Pro Thr 
65 70 75 80 

Gly Pro Val Gly Pro Lys Gly Glu Asn Gly Ser Ala Gly Glu Pro Gly 

85 90 95 

Pro Lys Gly Glu Arg Gly Leu Ser Gly Pro Pro Gly Leu Pro Gly lie 
100 105 110 

Pro Gly Pro Ala Gly Lys Glu Gly Pro Ser Gly Lys Gin Gly Asn lie 
115 120 125 

Gly Pro Gin Gly Lys Pro Gly Pro Lys Gly Glu Ala Gly Pro Lys Gly 
130 135 140 

Glu Val Gly Ala Pro Gly Met Gin Gly Ser Thr Gly Ala Lys Gly Ser 
145 150 155 160 

Thr Gly Pro Lys Gly Glu Arg Gly Ala Pro Gly Val Gin Gly Ala Pro 

165 170 175 

Gly Asn Ala Gly Ala Ala Gly Pro Ala Gly Pro Ala Gly Pro Gin Gly 
180 185 190 
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Ala Pro Gly Ser Arg Gly Pro Pro Gly Leu Lys Gly Asp Arg Gly Val 
195 200 205 

Pro Gly Asp Arg Gly lie Lys Gly Glu Ser Gly Leu Pro Asp Ser Ala 
210 215 220 

Ala Leu Arg Gin Gin Met Glu Ala Leu Lys Gly Lys Leu Gin Arg Leu 
225 " 230 235 240 

Glu Val Ala Phe Ser His Tyr Gin Lys Ala Ala Leu Phe Pro Asp Gly 

245 250 255 

His Arg Arg Leu Asp Lys Val Glu Glu Glu Val Asn Leu His Glu Asp 
260 265 270 

Phe Val Phe lie Lys Lys Leu Lys Arg Cys Asn Lys Gly Glu Gly Ser 
275 280 285 

Leu Ser Leu Leu Asn Cys Glu Glu Met Arg Arg Gin Phe Glu Asp Leu 
290 295 300 

Val Lys Asp lie Thr Leu Asn Lys Glu Glu Lys Lys Glu Asn Ser Phe 
305 310 315 320 

Glu Met Gin Arg Gly Asp Glu Asp Pro Gin lie Ala Ala His Val Val 

325 330 335 

Ser Glu Ala Asn Ser Asn Ala Ala Ser Val Leu Gin Trp Ala Lys Lys 
340 345 350 

Gly Tyr Tyr Thr Met Lys Ser Asn Leu Val Met Leu Glu Asn Gly Lys 
355 360 365 

Gin Leu Thr Val Lys Arg Glu Gly Leu Tyr Tyr Val Tyr Thr Gin Val 
370 375 380 

Thr Phe Cys Ser Asn Arg Glu Pro Ser Ser Gin Arg Pro Phe lie Val 
385 390 395 400 

Gly Leu Trp Leu Lys Pro Ser lie Gly Ser Glu Arg lie Leu Leu Lys 

405 410 415 

Ala Ala Asn Thr His Ser Ser. Ser Gin Leu Cys Glu Gin Gin Ser Val 
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420 425 430 

His Leu Gly Gly Val Phe Glu Leu Gin Ala Gly Ala Ser Val Phe Val 
435 440 445 

Asn Val Thr Glu Ala Ser Gin Val He His Arg Val Gly Phe Ser Ser 
450 455 460 

Phe Gly Leu Leu Lys Leu 
465 470 
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